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THE MOTION OF A FLUID IN A FIELD OF 
RADIATION 
By E. A. MILNE 
[Received 13 January 1930] 

1. Introduction. The hydrodynamics of a fluid moving in a field 
of radiation is identical, as regards the dynamical equations, with 
classical hydrodynamics, provided terms are introduced to allow 
for the stresses caused by the flux of radiation.* These, in general, 
form a stress-tensor, partly analogous to the stress-tensor of ordinary 
viscous motion. The radiative stress-tensor differs from the stress- 
tensor of viscous motion in that it does not reduce exactly to a simple 
hydrostatic pressure when the velocity-gradients are put equal to 
zero; tangential components of stress survive, proportional to the 
second differential coefficients of the radiant energy-density with 
respect to the optical thickness.t These terms are, however, negli- 
gibly small, under stellar conditions, compared with the principal 
components of stress, and to the same order the latter are all equal. 
\pproximately, then, if we neglect radiative viscosity and neglect at 
the same time small terms depending on the second-order tempera- 
ture gradients, the radiative stresses reduce to a hydrostatic pressure 
which we shall call p’. All we need to do, in forming equations of 
motion, is to add p’ to the ordinary fluid-pressure p. We must not, 
however, assume without proof that the same rule holds for integrals 
of the equations, and we shall find later an example (convective 
equilibrium) where the latter is not true. 

The equation of continuity is unaltered. 

This scheme of equations suffices for the discussion of a fluid at 
constant temperature. But when temperature variations exist or are 
brought about by the motion, and when the heating and cooling 
effects of the emission and absorption of radiation are taken into 
account, we require another equation evaluating the rate of change 


of the temperature at any point. Such an equation is fundamental, 


for example, in discussing the pulsations of a star or motions in sun- 
spots. The radical importance of this equation can be seen from the 
Rosseland has discussed terms arising from the finite velocity of light, 


but such terms are small compared with those here considered. 
+ These are fully discussed by the author in a contribution to Handbuch 


ler Astrophysik (in press). 
3695 B 








2 E. A. MILNE 
following example. In the classical mechanics of a dynamical system 
every particle is in equilibrium if at a given instant the accelerations 
of all the particles are zero. But this condition is not sufficient when 
temperature changes may occur. If D7'/Dt at any point is not zero, 
where 7' denotes the temperature and D/Dt is the Lagrangian opera- 
tor ‘following the motion’, the resultant heating or cooling may and 
usually will affect the dynamical stresses, and mechanical equilibrium 
will not continue to be preserved, in general. If x is a co-ordinate 
of a particle, D°x/Dt® may be non-zero even if D*x/Dé? and Da/Dt 
are zero. It can be shown that the displacement of a given particle will 
in general obey a differential equation of the third order in the time. 
This is borne out by Jeans’s investigations on the stability of stars. 

Unfortunately no consensus of opinion exists as to the form of the 
equation determining D7'/Di. A fundamental memoir by 8. Rosse- 
land,* which derives this equation, has been generally ignored. His 
method was to evaluate the energy flowing into a closed surface fixed 
in space and then t~ .pply a Green’s theorem transformation. His 
final equation neglects, however, the mechanical effect of radiation 
pressure in instantaneously compressing the medium. Eddingtont 
has frequently used an equation claiming to describe the ‘adiabatic’ 
motion of a fluid, but his equation treats an element of fluid as if it 
were enclosed in totally reflecting walls; it ignores the unequal flow 
of radiant energy through the different parts of the boundary of the 
element. Jeans{ has derived an equation for D7'/Dt by ‘following 
the motion’ of an element as it warms or cools, but he neglected the 
change in radiant energy imprisoned due to change of volume. His 
formula was corrected by H. Vogt,§ and the correction was accepted 
by Jeans,|| but Vogt’s method fails to take into account the difference 
between the flux of energy through a moving surface and that 
through a surface at rest. 

The following is an attempt to obtain the form of the D7/Dt 


equation in a rigorous way. The method differs from Rosseland’s 


chiefly in that it evaluates the energy-exchanges of the matter and 
aether inside a surface moving with the matter instead of inside a fixed 
surface. The result is an equation different from any yet given; but 

” Astrophys. Journ. 63, 342 (1926). 

+ M.N. 79, 13 (1918); Internal Constitution of the Stars, p. 190 (1926). 

t M.N. 85, 917 (1925). 

§ Ast. Nach. 232, No. 5545 (1928). 

|| Astronomy and Cosmogony, p. 114 (footnote) (1928). 
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it agrees with Rosseland’s when to his equation a radiation-pressure 
term is added. The accuracy of the equation is checked by using it 
to calculate the rate of change of the energy of a star which is con- 
tracting quasi-statically. 

An exact integral of the complete system of equations is then 
obtained, under certain conditions, in the form of a generalization 
of the Bernoullian energy-integral of classical hydrodynamics; it 
gives the change in flux of radiation between any two points on a 
stream-line when the motion is along the temperature gradient. 
Lastly an exact integral is obtained for ‘convective’ or ‘adiabatic’ 
equilibrium when account is taken of radiation-pressure. It is usually 
assumed that under radiation-pressure the well-known adiabatic law 
po py becomes p-+p’ x p”, where p is the density, p the gas-pressure, 
p’ the radiation-pressure, y and y’ ratios of specific heats. This is 
shown to be false. The temperature distribution in a column in 
convective equilibrium under gravity and radiation-pressure is 
obtained, and it is shown that as the depth increases, the density 
tends to a finite maximum. Some numerical application to solar 
conditions is made. The temperature distribution in convective 


equilibrium is shown to be a particular case of the Bernoulli energy- 


integral. 

2. Lemma. We shall need the following lemma. Consider a fluid 
in motion in any manner, either confined within a fixed boundary 
or with its density vanishing everywhere at the boundary. Let there 
be no external masses, so that the gravitational potential is derived 
from the fluid itself. Let V be the gravitational potential at any 
point, p the density, x, a typical space-co-ordinate (one of 2, y, 2), 
uw, the corresponding velocity-component (one of u, v, w), t the time, 
do an element of volume. Then 


' oV eV 
{Il p at do = {|| Puy én, do, 


where the volume integrals are taken through the whole fluid and the 
repetition of the suffix « implies* summation over the corresponding 
three suffixes. 
Proof. By the equation of continuity, 
c é 
= + Aa, (pu) =O 
* Here and elsewhere. 


B82 





4 E. A. MILNE 
The potential V at x, is given by 


r=([fé 


where 2 = (w—a' P+ (y—y')?+(2—-2') 
and p’ denotes the density at x,. Hence 


l 
\\ ea : ; (p’ u,,) do’ 
r 
0 [pux\., a. a: : 
lose 
; sii ‘ 
Q’ | , , 
= yy di \\| p Wada? 


on transforming by Green’s theorem and denoting by n, a typical 
direction-cosine of the outward normal at the surface element dS’. 


On the hypotheses made the surface integral vanishes. Hence 


pep eV a a Pes. 1 ” 
\J. it Sess | pdo|| p Me Ga , ao. 


Interchange the order of integration on the right-hand side. Then 


{{ d | | OO b. l 

ae ao p’ Wy ao ‘| Pp F a, - ao 
: C2 

ir p "Wy do’ by. bi , d 

| Pp Uy 

= | puy m do 


This establishes the lemma. 
Corollary. Let dW/dt be the rate of performance of work by the 
gravitational forces on the fluid. Then since a typical force-com- 





ponent is peV /éx,, 
dW 
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on applying the lemma. Multiply the equation of continuity (1) by 
SV’, integrate over the volume of the fluid, and add to (4). We find 


dw re eT 
{| E (eV )+ an, u.) | * 


But if d is any function of position, by a well-known theorem 


Alem (zeee]s 


where D/Dt on the left-hand side denotes differentiating following 
the moving surface. Hence 


dw OD . : 
di call dV do. (6) 


By integration of (6) with respect to the time we have a proof of 
a well-known formula in attractions, namely, that the gravitational 
potential energy is —}{{{pV do. We must not, however, attach any 
physical meaning to the localization of gravitational energy appar- 
ently implied by the existence of this integral; any number of 
different volume integrals for the energy can be given, each one 
apparently implying a different localization of energy. What is of 
physical importance is the localization of the appearance of energy 
when the configuration is changing. This is given by (3) whatever 
the bounding surface drawn in the fluid. Our result (6) verifies that 
this localized appearance of energy, integrated over the whole volume 
of the fluid, is equal to the rate of decrease of the potential energy. 

3. The conservation of energy for a fluid moving under its 
own stresses, in a field of gravitation and in the presence of 
radiation. Let EF be the radiant energy per unit volume at any 
point, C7 the thermal energy of the fluid per gramme, 7' the 
temperature. Then H+-pC,, 7 is the total thermal energy contained 
within unit volume at any point. For perfect gases or for only 
slightly compressible liquids C,, may be taken as constant or a 
function of temperature only ; it is then the specific heat at constant 
volume. When dissociation may occur, the above specification may 
be considered as the definition of C,. Until we come to discuss 
perfect gases we shall not need to separate C,, and 7’. 

Let « be the rate of liberation of sub-atomic energy at any point, 
per gramme. Let p be the hydrostatic pressure, p’ the radiation- 
pressure, F, F,, F, the components of the net flux of radiation. 


x” 





6 E. A. MILNE 
The radiation-pressure p’ is equal to 4#, but we shall not use this 
relation until we are compelled, in order to trace the identity of the 
various terms. Let V be the total gravitational potential, not neces- 
sarily wholly due to the fluid under discussion. 

Consider a surface S moving with the fluid, i.e. such that it con- 
sists, during the motion, of the same fluid-elements, reckoned macro- 
scopically. The rate of change of the energy contained within this 


surface, thermal and kinetic, is 


D 7 Y al 
pal | [eto » l'+dpu,u,) do; 


The rate of flow of energy into the space enclosed by S consists 
of four contributions :* (1) the rate of liberation of sub-atomic energy 
[Jf ep do; 

(2) the work done by the total pressure, p+ p’, at the surface, 
- {fp +p’)n,u, aS ; 


(3) the work done by the gravitational forces 


oV lo: 
Pra oy OO» 


(4) the rate of flow of radiant energy into the surface. If the surface 
were at rest, the flow would be 

—f{n ds, 
but the flow through a moving surface will be different. The simplest 
way of calculating the correction is to notice that as compared with 
a surface at rest the flow in dt will be greater by the radiant energy 
contained within the volume through which the surface S expands. 
Hence there is an added rate of flow of amount 

ff Hn.u, ds. 
Hence 


D . 
mil | { +pC,, T+ 3U.Uq) do || do — [Jv +p’) nu, aS+ 

eV _— 

= \\| Puter Bxe do - \| nF, dS + \| En,u,d8. (7) 


* We neglect conduction of heat. This could easily be taken into account 


if necessary, but it is negligible in stellar applications. 
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lt seems at this stage desirable to point out that we must avoid 
the error of attempting to count in also the rate of change of the 
potential energy of the fluid contained within S. If the surface 8 
contained the whole fluid we could if we liked ignore the work done 
by the gravitational forces on the volume elements, and calculate 
instead the rate of increase of the whole potential energy. If the 
gravitational field were derived entirely from the fluid under con- 
sideration, we could delete the term {{{ pu,(@V/éx,) do on the right- 


hand side of (7) and add in a term a ||| te" do on the left-hand 


side. Equation (7), thus altered, should remain valid. The corollary 
to the lemma shows that this is so. For taking (7) as it stands we 
can transfer the third integral on the right to the left, obtaining 

-[{{ pu,(@V/ex,) do, and by (4) and (6) this integral, when extended 


to the whole fluid, is precisely | |] be" a: The upshot of this 


parenthesis is that in forming (7) we have accurately calculated the 
rate of liberation of energy, locally, due to changes in the gravita- 
tional configuration, whatever the surface S; the quantity « repre- 
sents sub-atomic liberation only—it does not include gravitational 
liberation or convective liberation, which are taken into account 
elsewhere in the equation. 

We now make use of the dynamical equations of motion 


Du a(p+p’) eV 
t= — | 8 
P'Dt ox, | dx, (8) 


to remove the terms in @V/éa, from (7). We then apply (5) to trans- 
form the left-hand side of (7), and we transform the surface integrals 
on the right-hand side by Green’s theorem. The result is 


|| =| E+pC,7T+4pu,u,|do+ 
C 


*{I) 2 [up{# tpC, T+ $pU,t,}]do = 
OX 


\\| ep do -{\l =" (p+p’)u,|do -\I| “ F do + 
Own Du, , @ 4 
+ {jl ag, (Be) do +{{] E - 4 “re | tn 








E. A. MILNE 
Since the surface S is arbitrary, we can now remove the sign of 
volume integration which occurs throughout. Using 


D c a c 


and remembering that « and f are dummies, we obtain 
aR Pa ps as Al 4 P 
ok LOT Op _, pug) Pn 0) Bee > rs. (pup) “a 
ct ot OX Dt ai ct xg 
OF » OU 
<_p— —*—(p+p') —* 


OL yy CXL y 


Applying the equation of continuity (1) and using it also in the 
form Dp tty ‘ 
Dt Pox, _ 
we have finally 

7 Y spy , 
oe ao yt) pp Dp adie. (9) 
ot Dt p Dt 

Equation (9) is the desired equation for D7'/Dt which completes 
the system of equations determining the motion. It should be noticed 
that E is preceded by the Eulerian operator @/ét, not the Lagrangian 
D/Dt. This is important in applications. Explicit reference to 
gravitational energy has disappeared, but it is fully counted in. The 
equation contains no reference to kinetic energy, though it is by no 
means obvious, when gravitational contraction is occurring, that no 
interchange occurs between kinetic energy and thermal energy ; the 
method of proof of Jeans and Vogt, even if it led to the correct 
result, seems open to objection in this respect. 

4. We first give a physical meaning to formula (9). Integrate both 
sides over any finite volume surrounded by a fixed surface. The 
right-hand side transforms to 

fff —pdo — f{ nF ds. (10) 

This represents the total output of sub-atomic energy less the amount 

leaving the surface in the form of radiation. It follows that the 
quantity 

oH, D(C,T) p+p' Dp 


4 11 
a? De p Dt (11) 


represents the net gain of liberated sub-atomic energy per unit 
volume less the net loss by radiation. 
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5. Quasi-static contraction of a star. We apply this to a 

spherical star, contracting quasi-statically. From the result of the 
last paragraph, the net rate of gain of energy in the whole star is 


oE  D(C,T) p+p' Dp 
+ ——— = 10. 
ING oe et ae 


This should be equal to the rate of change of the energy of the star 
calculated in the usual manner. I have shown elsewhere* that the 
gravitational energy of the star is 


—3 [{f(p+p’) do. (12) 


The thermal energy is 

fff (2+p¢, T) do. (13) 
Now we have always EH = 3p’. Hence, adding (12) and (13) the total 
energy is 

{ff (pC, T—3p) do. (14)+ 
Accordingly, for a spherical star contracting quasi-statically we ought 
to have 


D sige cE , D(C,T) p+p'Dp e 
= C., 1 —3p) do = pw hl I Jo. (li 
mall —— ne °~ De ; a 


The satisfaction of this identity will be a test of the correctness of 
the fundamental equation (9). To verify it, we write 


oE DE 0H 
= u 


= — . 16 
ot Dt Or (16) 
where u is the (slow) radial velocity at distance r from the centre. 


Since do = 4rr? dr, we have on integrating by parts 


aon ‘ 9 
-\|] u os do = [—4nr?u \+| B(= “ =) Arr? dr. 
or or 6 


The integrated part vanishes at both limits, neglecting, as in the 


* M.N. 89, 740 (1929), equation (6). 
+ We notice that for a star of perfect gas, obeying the equation of state 
p (R/p)pT, for which C,(y—1) R/p, y being the ratio of specific heats, 


expression (14) reduces to 
4—3 
\f| ’> do. 
y—l1 


This expression, though containing no explicit reference to the radiant energy, 
fully includes it. It remains true if y and p are variable from place to place 


in the star. 














10 E. A. MILNE 
demonstration of (12), the relatively small surface-value of #. The 
equation of continuity is here 
1D 2 ( 
See oe 4 a, (17) 
pDt' r ' or 
Hence the right-hand side of (15) becomes 
DE EDp. D(C,T) p+p'Dp 
aaa : —— 7 2 re 18 
\( Dit ppt? Dt ; ae (18) 
Now in quasi-static motion we have 
1a(p+p’) GM(r) 


19 
p or r2 (19) 


> 


where G is the constant of gravitation and M(r) is the mass contained 
in radius 7. Apply the operator D/Dt to this equation. We find 


~ pe or =6Dt Tp or Dt p or or 
Substitute for GM(r)/r? from (19) in (20) and use (17). We find 


1d(p+p')Dp _1éD(p+p') 1e(p+p')eu G M(r) ou. (20) 


pe sta 


&D(p+p’) . 4ue(p+-p’) 

o Dt 'r or 
Multiply (21) by 4z7r3 dr, and integrate through the star. Integrating 
by parts and again using the equation of continuity (17) we find 


3 "D(p+p’) 
; Dt 


= 0, (21) 


p-+p’ Dp 


Pe eee 29 
Di 4ar? dr = 0. (22) 


4nr2 dr — a 


18) replace the term in DE/Dt by the term in 3Dp’/Dt given 
). Then (18) becomes 


gPtP Dp _ EDp ot a: me .*) 4rr? dr 
ae ee whe ee 


o((P Dp Dp D(C, T) “ 
or a|(2 Dt — Dt do + p Dt do. (23) 
In the first of these integrals substitute for Dp/Dt from the equation 


of continuity. It becomes 


of (PP... a:. 
-3((7 +p div u) do, 


u being the vector velocity; this is simply 


3D 
or \ do. (24) 


n ( 
Ja 
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Since pdo is conserved during the motion, the second integral in (23) 
may be written 


D 
Di Ce do. (24’) 


Adding (24) and (24’) we find that the right-hand side of (15) is 


| ae 
Al (pC, T'—3p) do, 


which is precisely the left-hand of (15). This check is one which any 
proposed equation of the type (9) must satisfy. 

6. Comparison with results of other investigators. The 
formula of Jeans,* as corrected by Vogt,} runs in the present notation 


DE, D(C,T) (p+p' +E) Dp _ 


ee = — iv 9’ 


This differs from our result (9). But it will be observed that the 
left-hand side of (9’) is identical with the integrand of (18). It follows 
that (9’) satisfies the test we have just been applying. Thus (9’) is 
correct on the average, for a star contracting quasi-statically. Equa- 
tion (9’) corresponds to a different partition of the radiant energy 
amongst the volume elements. Reference to Vogt’s proof will show 


that in the deduction of (9’) the flow of radiation through a moving 
surface has been taken as identical with that through a fixed surface. 
Xosseland’st formula in the present notation is 
1S ee as. ; ; ? 
at +5 (eC ,1')+div (pC,,Tu)+p div u = ep—div F. (9”) 
Introduction of the operator D/Dt and use of the equation of con- 
tinuity converts (9”) into (9) provided p is replaced by p+p’. In 
Rosseland’s derivation, however, p denotes (as with us) the ‘hydro- 
static part’ of the material stress-tensor, so that the mechanical 
effect of the radiation-pressure has been ignored. 
I have found the simple form (9) more convenient than Rosseland’s 
in making actual calculations. 
7. The analogue of the Bernoulli energy-integral. Consider 
now the case of steady motion in a region in which the liberation of 


* M.N. 85, 917 (1925), equation (6); see also Astronomy and Cosmogony, 
p. 114 (1928), equation (105-4) and footnote. 

+ Ast. Nach. 232, No. 5545 (1928), equation (11). 

{ Astrophys. Journ. 63, 355 (1926), equation (37). 





12 E. A. MILNE 
sub-atomic energy ¢« is zero. Then since EF at any point is now 
constant in time, 0H#/ét is zero, and (9) reduces to 
Fit p’ Dp _ ey (25) 
Dt p Dt 

Now consider the special case in which the streamlines coincide 
with the curves of flux defined by the vector F, i.e. the special case 
in which the flow is along the temperature-gradient. (The upward 
flow in a sun-spot is of this type.) Let o be the cross-sectional area 
of a given elementary tube of flow at any point, s the arc measured 
along the axis of the tube. Then if g denotes the scalar velocity of 
the fluid, Di = ds/q, and 

d 


; l . 
div F ca Fo), 


where F = |F|. Hence (25) becomes 
d(C,,T) p+p'dp i i 
4 E ds p Z| ~ eds Fey. 
Now the equation of continuity is here 


pqo = constant = f, say. 


Hence (26) may be written 


[aC,T) ptp'dp] <d,_ 
s[ ds a p” |= ~g"*- 


Now the equation of motion is here 
dq dV d(p+p’) 
Plas ds ds 
dq dV d/p+p' p+p'dp 
7 lds ds ds p p® ds° 


Multiplying (28) by f and adding to (27) we find 


[a(c,T) , dq dV, d/p+p'\] _ 
J ds ‘ds ds | p - a 


This integrates in the form 
t 


| C T+3¢—V+" : : ] +Fo=C, (30) 
v) 


where C is constant along the streamline. Dividing by f and sub- 
stituting its value, (30) becomes 
* ‘ 7 
eae ,, pp’ F 
C,T+4¢—V+ 
ppg 


= C", (31) 
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where C’ is constant along the streamline. Equation (31) is the ana- 
logue of the Bernoullian energy-integral of classical hydrodynamics 
when account is taken of the flux of radiation and radiation-pressure. 

Equation (31) may be considered a fundamental relation in the 
theory of sun-spots. Here we may consider the motion rectilinear 
and vertical, approximately, and so here pq is constant. Also 


dV /dx = —g, x being the height, so that V = —gx. Thus (31) becomes 
q{pC,, T+3p9q?+-gpx+-p+p’]+F = constant. (32) 


This equation expresses the constancy of the sum of the fluxes of 
energy of all kinds past any point.* Thus it determines the difference 
in radiation-flux between the top and bottom of a sun-spot. An 
approximate form of (32) has been given by Rosseland,} and applied 
by him to sun-spots, but Rosseland only stated the result in a form 
involving undefined means. The numerical application of (32) to 
sun-spots will be given elsewhere. 

8. Adiabatic motion in a star. For a spherically symmetrical 
star we may substitute for 0#/ét in terms of DE/Dt by means of (16). 
Equation (9) then becomes 


DE _ 0H , DC,T)_p+p'De_ .,_ aie. (33) 


Dt dr" Dt p Dt 


If we neglect the heating and cooling by the absorption and emis- 
sion of radiation, and the heating by liberation of sub-atomic energy, 
in comparison with the changes of temperature produced by convec- 
tion and by compression and expansion, we have what Eddington 
calls the ‘adiabatic’ motion of stellar material. The assumption is 
equivalent to putting the right-hand side of the fundamental equa- 
tion (9) equal to zero, and accordingly (33) becomes 


DE_2E , D(C,T)_ p+p'Dp_,, (34) 


—u 
Dt or” Dt p Dt 
Describe by 5 the changes occurring in an interval d¢ following the 
motion. Then wdt = dr, the radial expansion, and (34) becomes 
9+ p’ 


sE— — 8r-+p3(C, T)—! 
p 


cr 


dp = 0. (35) 


This equation may be compared with the one given by Eddington 
for adiabatic motion. Considering each element of material as if 


* As does (30). + Loe. cit., equation (53). 
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enclosed ‘adiabatically’, i.e. in totally reflecting walls, he finds* in 


p p 


This gives, on multiplication by p, 


our notation 


ptp' +H 
P 
This differs from (35). Equation (35’) is equivalent to the Jeans- 
Vogt equation (9’) for the case of adiabatic motion, and so gives the 
right change in energy when integrated over the whole star. But it 
corresponds to a faulty allocation of radiant energy amongst the 
volume elements. It assumes in effect convection of radiant energy 
with the material. But we cannot legitimately regard the volume 
elements as enclosed in totally reflecting walls. Equation (35) shows 


SB+pd3(C, 7’) ip @. (35’) 


that Eddington’s formula requires correction for the gradient of 
energy-density. 

The simplest way of seeing that the Jeans-Vogt formula, and with 
it Eddington’s formula, implies convection of radiant energy with the 
material is to derive the Bernoullian energy-integral on the basis of 
the Jeans-Vogt formula. If we repeat the analysis of section 7, using 
(9’) instead of (9), we find for the integral 

; at 

|= +0, T+14¢—V 4 ptP'|+ Fo =C (30’) 

p p 
instead of (30). The term fH/p would represent convection of radiant 
energy ogZ. Actually under the conditions of the problem the dis- 
tribution of radiant energy remains stationary ; there is no convection 
of radiant energy, but simply the radiation flux Fo. Equation (30) 
could in fact be written down from first principles; the importance 
of the formal proof of section 7 is that it confirms the correctness 
of (9) and disproves the Jeans-Vogt formula. The significant point is 
the occurrence of @/ét, not D/Dt, in front of £ in (9), and it is strong 
support for the correctness of (9) that Rosseland’s method, using a 
fixed surface, also yields 0F/ét. 

9. Convective equilibrium under radiation-pressure. Our 
last application of (9) is to the type of distribution which is known 
(curiously enough) as ‘convective equilibrium’. This is the type of 

* Internal Constitution of the Stars, p. 190 (1926), equation preceding 
(129—2). We have simply replaced his V, which may be taken to be the specific 
volume, by 1/p. 
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equilibrium when the material.is being constantly stirred in such 
a way that the resulting convection currents are so rapid that the 
heating and cooling by compression and expansion is dominant com- 
pared with the heating and cooling by absorption and emission of 
radiation.* 

Since the state is steady, 0#/ét = 0, and (9) reduces to 
(C, T +p’D . 
CT) _ptes P= ep—div F. (37) 
Dt p Dt 
If q is the velocity, Ds the element of path of any element during 
the stirring, Dt = Ds/q and (37) becomes 
(CT +p’ D ] , 
Pe C.7)_pte er. ~(ep—div F). (38) 
Ds p Ds q 
On the hypothesis of convective equilibrium, ¢ for any small element 
is relatively so large that the right-hand side of this equation may 
be neglected in comparison with the left-hand side. Also Ds is 
random in direction. Hence (38) reduces to 


pd(C, T)—PT?P dp =0. (39) 
Pp 


Equation (39) is the fundamental differential equation defining 
convective equilibrium. It needs of course to be combined with the 
equation of mechanical equilibrium of the material as a whole before 
the space-temperature distribution becomes definite. But equation 
(39) as it stands defines a relation between p and 7' for any given 
equation of state. It is usually assumed without proof, on the analogy 
of the corresponding differential equation when p’ = 0, that in the 
case of a perfect gas the integral of (39) is of the form 

p+p' x pY, (40) 
where y’ is a suitably defined ratio of specific heats. I propose to 
show that this assumption is false. We proceed to integrate the 
differential equation (39) when the equation of state is that of a 
perfect gas. 

We have always p’ = taT". (41) 

* ‘Stirring’ of course implies the introduction of external forces not 
taken into account in deducing (9). But other definitions of convective 
equilibrium which may be framed (e.g. the state after stirring) will be found 


to imply (39). The method of deduction of (39) from (9) is due to Rosseland, 


loc. cit., p. 355. 
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For a perfect gas, with ratio of specific heats y, gas-constant R, and 
molecular weight », we have the equations 
p= (R, p)pT’, (41’) 
R ” 
(41”) 
(y—1)p 


Inserting these in (39) we find 


. par —| = ie 3a | dp 
(y—1)p a 


or R i vial | dp = 10%. (42) 
ayes". 7” 
Multiply both sides by p%v-*. Then both sides become perfect dif- 


ferentials and we find 


1 
Pa 


= € 
3y—4 


1(p3v-4). 


Integrating we have 


1\3 
= 7 A r py 's (43) 
3(y—l)p\ 7 3y—4 


where C is a constant. 
By means of (41) and (41’), (43) may be put in either of the 
equivalent forms 
] C 1 
a = (44) 


3(y—1)p’ | 8y—4 Jap 
to F = o() (f). (44’) 
3(y—1) | 8y—4 p R (7) 
No value of y’ exists, either a constant or a function of 7’, for which 
(44) or (44’) can be put in the form (40). 
The case of negligible radiation-pressure is obtained by putting 
a = 0 in (43) or p’ = 0 in (44’). We have then respectively 
Ta, 
pc py”, 
the usual results. We see that the presence of radiation-pressure 
makes a fundamental difference to the integrated adiabatic relation ; 
we cannot simply add the radiation-pressure to the gas-pressure. 
If y > 4, (44) shows that p/p’ decreases as p increases; (44’) then 
shows that p/p” increases as p increases. Thus as we pass down 
a column in convective equilibrium, radiation-pressure p’ becomes 
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increasingly important relative to gas-pressure p; p/p’ is not even 
approximately constant. 

10. The convective equilibrium of a vertical column of gas 
under radiation-pressure. An atmospheric column of perfect gas 
in convective equilibrium under gravity, in the absence of radiation- 
pressure, is well known to possess a temperature distribution in which 
the temperature decreases linearly with the height. We proceed to 
determine the change brought about by radiation-pressure. 

The equation of mechanical equilibrium for a vertical column under 
gravity g is 

d(p+p') _ —» (45) 
dx / 
where x is the height measured vertically upwards. This may be 
written 
ale )+Pe? in ~%. (46) 
dx\ p p® dx 
The differential equation of convective equilibrium, (39), may be 
written in the form 
QO,7)_ P19 @ _ 9, (47) 
dx gp az 


Adding (46) and (47) we find on integration 


CO, T4+PTP — —g(x—A), (48) 
p 


where A is a constant. 

Equation (48) is the general equation giving the space-temperature 
distribution in convective equilibrium. By its mode of derivation, 
(48) is seen to be a particular case of the Bernoullian energy-integral 
(32). For in convective equilibrium the flux of macroscopic kinetic 
energy is zero, and the radiative flux is disregarded. Removing from 
(32) the terms in g? and F we obtain (48). I do not remember to 
have seen this aspect of convective equilibrium mentioned previously. 
It will be noticed that in deriving (48) we have used no equation of 
state; we have simply used the general differential equation of con- 
vective equilibrium, not its perfect-gas integral. 

For a perfect gas, the use of (41), (41’), and (41”) transforms (48) 
into 

y BT eT" 
y—lp p 
c 


—g(x—A). (49) 
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If in this we neglect the term ja7", we obtain 7' as a linear function 
of x, the classical result. 
To discuss the form of distribution implied by (49) we now use the 
integral convective relation (43) in the form 
= c+ a Cp*-*y, (50) 
3(y—1)u TT?  3y—4 


and substitute for p/7'? in the second term of (49). We find 





= —g(x—A). 


This can also be put in the form 
RT [4 3(y—i)? ‘ 
=n (y—3) — —g(v—A). (51’) 
w(y—1)L3 (3y -4)—(3a/¢ )p*” ' 
We may confine our attention to the case y > 4. At great heights, 
p is small, and the temperature distribution is approximately linear 
in z. As we descend, p increases, but it never exceeds the limiting 


value pmax given by 
Y 


3y—i = _— 
max ha 


(3y—4). 


This value is approached asymptotically as x -> —oo. For this limit- 
ing value, pY-1/T7' is zero by (43), and so, since p is finite, 7’ is infinite. 
Equation (51’) shows that at great depths 7' increases with —< less 
rapidly than linearly. 

In terms of pyax, (44) and (51’) become 


VE +1 (Pas) (53) 

3(y—1)p" p 

71 a 
= st Tew |= 9-4). (54) 
H(y—1)L3 1—(p/Pmax) y 

11. Numerical application to solar conditions. A sun-spot is 
sometimes supposed to be a region of convective equilibrium. It is, 
however, rather a region of convective ascent—mass motion in which 
cooling by convection occurs according to equation (32). Thus the 
concept of convective equilibrium does not apply. Convective equi- 
librium can only be established by some external mixing agency, and 
it is not clear that any such agency exists on the sun, even in sun- 
spots. Nevertheless, any turbulent motion will tend to set up a 
temperature-distribution striving to approach that of convective 
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equilibrium. Just as in the case of the earth’s atmosphere, the actual 
state of affairs will be intermediate between radiative equilibrium 
and convective equilibrium. Radiative equilibrium on the sun has 
been thoroughly studied. It becomes of interest to study numerically 
the other extreme, convective equilibrium. 

For definiteness take y= 3. Then 3y—4=1 and 3(y—1)=2. 
Consider a column in convective equilibrium for which the conditions 
at the top correspond to those in the umbra of a sun-spot. Let the 
suffix 0 refer to this level. We shall adopt 

Po = 5X 10-5 atmos. = 50 dynes em.~?, 7', = 4,500°. 
Then if we assume for definiteness a mean molecular weight » = 20, 
with R= 8-26 x 107, we have p,= 2:69 x 10-® gm. cm.~*. Further, 
with a= 7-64 10-4, we find pj) = 1-04 dyne cm.-*. Introducing 


these values in (53) we find 


Pmax BY-4 79 -8 -3 
== 2! Pmax = 6-72 < 10-° gm. cm.~*. 


= 2 ), 
Po 

To obtain an idea of the rate at which p approaches pmax as we 
descend, we will now determine the level where p = }p,x- Let the 

suffix } denote this level. Then by (53) 

P;|P3 — 2, 
whence, using the value p,=4x6-72x10-° gm. cm.-*, we find 
T’, = 30,000°. To find the depth in kilometres we use (54). The 


depth x)—a, is given by 


. = . == 7 ~-|-2 ae 9 ; 
— iL i( | i 1(5+ 2425) 


= 120 Km., 


on inserting g = 2-74 x 104 cm. sec.~*. 

Thus under the conditions of convective equilibrium, in a depth of 
120 Km., with »= 20, the temperature increases from 4,500° to 
30,000°, and the ratio of gas-pressure to radiation-pressure diminishes 
from 48 to 2. In radiative equilibrium, on the other hand, the ratio 
of gas-pressure to radiation-pressure changes only very slowly in the 
photospheric layers of a star.* In the photospheric layers on the sun, 
for 4 = 20 as above, the gradient of temperature is about 150° per 
Km., and a temperature increase of 25,500° would be only achieved 
in 170 Km.; the mean gradient in the corresponding convective 

* See Bakerian Lecture, Phil. Trans. 228A, 456 (1929). 


c2 
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region is about 215° per Km. If the photospheric temperature is 
taken as 6,000°, the temperature in the convective column will over- 
take that in the radiative column at a depth of 25 Km., where the 
temperature is 9,500°. This is where we should place the base of 
the spot-vortex* if it could be assumed to be in convective equi- 
librium. Actually, since the gradient in the ascending gases in the 
spot will be intermediate between radiative and convective equili- 
brium, the base of the spot-vortex will be deeper than this. Rosseland 
has given an estimate of 50 Km. for the thickness of the spot-vortex, 
and this is in agreement with some unpublished calculations of my 
own. The above calculations are not intended to represent sun-spot 
conditions—as remarked above the latter will be governed by equa- 
tion (32)—but they give a concrete picture of convective equilibrium, 
showing in particular the rapid approach to a limiting density. 

12. Disproof of convective equilibrium in a star as a whole. 
If the material forming a star were rapidly stirred throughout, so 
that convective equilibrium obtained momentarily, p and 7’ would 
be connected by equation (44). Hence nowhere in the star could the 
density exceed the value p,,,x given by (52). From our numerical work 
in section 11, a star with a pressure of 5x 10-° atmos. in its photo- 
sphere and an effective temperature of 4,500° would have a mean 
density less than 10-’ gm. cm.~* for » = 20 and a still smaller one 
for smaller ». The actual numerical values are of small importance, 
but the mere order of magnitude of the result shows that stars as 
a whole cannot be in convective equilibrium or in a state in any way 
approaching it; the observed mean densities are too large. In con- 
vective equilibrium they would be orbs of immensely greater radius. 
Formula (44) affords a justification, in the simplest possible way, of 
the assumption of radiative equilibrium for the stars as a whole. 

13. Summary. The fundamental equation governing temperature 


changes in a compressible medium in motion, taking account of 
radiation, is obtained in the form of equation (9), section 3. It differs 
from the form given by previous investigators. The equation is 
applied to quasi-static motion, to steady motion along temperature 
gradients, to adiabatic motion, and to convective equilibrium. 

I should like to express my general indebtedness to Professor 


Rosseland’s memoir, already cited. 


* Cf. H. N. Russell, Astrophys. Journ., 54, 293 (1921). 











SOME PROBLEMS CONNECTED WITH FOURIER’S 
WORK ON TRANSCENDENTAL EQUATIONS 
By G. POLYA 
[Received 8 February 1930] 

THis paper is concerned with a very bold argument by which Fourier 
tried to prove that a certain integral function has only real zeros: 
Fourier applied a rule which was, then and now, proved only for 
polynomials, to a transcendental function. The question arises 
whether this application can be justified by some general theorem. 
This question was much discussed between Fourier and Poisson, 
mentioned by some contemporary writers, and quite forgotten after- 
wards. I resume the question in the present paper. I have not been 
able to solve it completely, though I have been led by it to some 
general theorems which may be interesting in themselves (section 3). 
All I can show is that we may draw from Fourier’s argument some 
definite conclusion, namely, that the function under consideration 
has either only real zeros or an infinity of imaginary zeros. 

1. Historical introduction. The integral function which Fourier 
tried to prove to have real zeros only is 


puts © sl J (2vx) (1) 


— | — 
2° 12,92 12, 92,32 °°" 


(in the notation current now). The algebraical rule to which he 
appealed is this: a polynomial f(x) with real coefficients has real roots 
only, if its derivatives f'(x), f(a), ..., f(a),... have the property : 
if € is real and f(é) = 0, then fe-%(E)foM(é) <0. (2) 
The rule is easily established now by the theorems of Budan- 
Fourier or Sturm; but its discovery was prior to that of these more 
general and better-known theorems, and was an important step to- 
wards them. It is due to de Gua.t 
The function (1) satisfies the differential equation 
F(x) +f'(@) +2" (x) = 0; (3) 
from which we deduce, by repeated differentiation, 
foN(x) + uf (x) tafe) = 0. (4) 
The series (1) shows that f(€) cannot vanish for € < 0. If >0and 
f(é) = 0, then f’-%(é) and f%*(€) have opposite signs on account 


+ Histoire de VAcadémie des Sciences, 1741, p. 459. 
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of (4); ie. (2) is true. (Two successive derivatives cannot vanish 

simultaneously, since then, by (4), all would vanish, and f(x) would 

vanish identically.) Fourier applies de Gua’s algebraical rule to f(x) 

without further inquiry, and concludes that f(x) has real zeros only. 
Cauchy expressed doubt about the validity of Fourier’s reasoning.{ 

Poisson went further, and refuted it by an example.§ The differential 


equation 


af(x)+bf'(x)+ef" (x) = 0 (5) 


may be treated like (3), if a, b, c are real and ac > 0. We have in fact 


af?-)(x)+b f(x) +c fe(x) = 0, 
and (2) follows. But the function 
f(x) = e*—e* = e*(1—e*) (6) 
is an integral of (5), with a= 2, b = —3, c= 1, and Fourier’s argu- 
ment shows that all the zeros of (6) are real, which is cbviously false. 

Poisson returned to the question on other occasions. One remark 
is worth quoting: ‘Ces régles [the reference is to de Gua’s] supposent 
qu’en différentiant un nombre de fois suffisant |’équation qu’on con- 
sidére, on parvient enfin & une autre équation dont on sait que toutes 
les racines sont réelles.’|| It is worth noticing that (as appears still 
more clearly from the context) Poisson is thinking here not only of 
polynomials but also of transcendental functions whose derivatives 
of high enough order have real zeros only ; he regards the application 
of the rule of de Gua as legitimate for such transcendental functions, 
and for such only.}+ 

Fourier defended himself brilliantly and energetically;t{ whether 

t Théorie de la chaleur, Paris 1822, article 308, or @uvres, edited by G. Dar- 
boux, Paris 1890, vol. 1, pp. 335-7. 

‘Quant a l’idée qu’a eue le méme géométre d’appliquer aux équations 
transcendantes les régles établies pour les équations algébriques, elle donne 
naissance & plusieurs difficultés’ (Hwvres complétes de Cauchy, 2me série, 
vol. 2, p. 400). 

§ Journal de l Ecole Polytechnique, vol. 12, 19. cahier, pp. 381-3. I present 
the example in the manner of G. Darboux (@uwres de Fourier, vol. 1, p. 336) 
and with the numerical specialization considered by Fourier (@uwvres, vol. 2, 
p- 188). 

|| Mémoires de ? Académie des Sciences, vol. 8, pp. 367-8. 

+t He changed his opinion afterwards concerning the applicability of the 
rule to (1). Compare Mémoires de l’ Académie des Sciences, vol. 8, pp. 367-8, 
and vol. 9, p. 95. 

{i @uvres, vol. 2, pp. 176-8, and 185-210. He objects to Poisson’s example 
(6) that it does not satisfy (2) for € = — (but, in proving the result for (1), 
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correctly or not, we cannot tell even now. But he shifted his position 
more than he seems to admit, and especially on two points. 

He gives a new proof} of the reality of all zeros of the function (1), 
which is entirely rigorous if we appeal to auxiliary lemmas which 
may be regarded as evident now (though hardly then). He proves 
first the reality of the zeros of a polynomial f,,(x) of degree n, using 
a differential equation like (3), and the rule of de Gua, then of course 
valid. He then proves that f,,(~) > f(x) when n ->co.t 

Further, Fourier gives conditions for the validity of de Gua’s rule. 
He also states a more general theorem which it is worth quoting 
verbally: § ‘Etant proposée une équation algébrique ou transcendante 
dx = 0 formée d’un nombre fini ou infini de facteurs réels ou ima- 


cinaires 
(1 -*), (1 *). (1 -*), (1-5)... 
x B Y 8 


... Péquation dz = 0 a précisément autant de racines imaginaires 
qu il y a de valeurs réelles de x qui, substituées dans une fonction 
dérivée intermédiaire d’un ordre quelconque, rendent cette fonction 
nulle, et donnent deux résultats de méme signe pour la fonction dé- 
rivée qui la précéde et pour celle qui la suit. Par conséquent si l’on 
parvient & prouver qu’il n’y aucune valeur réelle de x qui, en faisant 
évanouir une fonction dérivée intermédiaire, donne le méme signe 
& celle qui la précéde, et a celle qui la suit, on est assuré que la 


proposée ne peut avoir aucune racine imaginaire.’ 

No proof of this theorem (by Fourier or another mathematician) 
is known. Stern|| gave an invalid proof, and repeated in greater 
detail some affirmations of Fourier. Since then the question seems 


to have been neglected. 


he did not examine & oo, nor has the sign of the function for € = + 0 or 
that of its derivatives anything to do with the criterion if it is stated correctly). 

+ @Wuvres, vol. 2, pp. 200-4. ’ 

t After having completed his proof, Fourier tries to establish that the 
function (1) is a product of linear factors (@wvres, vol. 2, pp. 204-7). Laguerre 
(see his @uvres, Paris 1898, vol. 1, pp. 202-4), by combining a general theorem 
of his own with the differential equation (3), showed that the limiting process 
of Fourier establishes the decomposition into linear factors, i.e. that, in 
Laguerre’s terminology generally adopted to-day, (1) is of genus 0. Laguerre’s 
work reads quite like an analysis and completion of Fourier’s proof, but he 
does not quote Fourier. 

§ Fourier, Analyse des équations déterminées, Paris 1831, p. 66. 
Crelle’s Journal fiir die reine u. angew. Mathematik vol. 22 (1841), 
pp. 1-62. See in particular pp. 49-50. 
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2. Hypothetical theorems. Before going further I have to 
explain a terminology often used in the sequel. 

An integral function which takes real values on the real axis will 
be called a real integral function. 

Let g(z) be a real polynomial or a real integral function of genus 
0 or 1, and y a non-negative real constant. The integral function 

fle) =e g(2) 
will be called of genus 1*. Such a function is certainly real ; its genus 
may be 0 or 1 or 2, but in the last case it is only ‘slightly’ higher 
than genus 1. 

Let us suppose for a moment that neither the real integral function 
f(z) nor any of its derivatives f’(z), f’(z), ... possesses multiple zeros. 
Draw the curve whose equation in rectangular co-ordinates is 

y =f), 
and consider its positive minima and negative maxima, i.e. its 
extrema which are convex towards the x-axis; the abscissae of these 
points are called by Fourier the critical points of f(z). In other terms, 
é is a critical point of f(z), if 
FONE) = 0, fr-NE)FOHM(E) > 0. 

We have to consider n = 1, 2, 3, ..., and each abscissa is counted as 
many times as there are different graphs in which it arises. The 
definition is more elaborate if there are multiple zeros (see section 5). 

In this terminology an essential part of Fourier’s theorem quoted 
in section ] is embodied in the following proposition : 

A. A real integral function of genus 0 has just as many critical points 
as couples of imaginary zeros. 

This statement is narrower than that of Fourier’s ; it includes only 
the case where the product considered by Fourier is absolutely con- 


vergent, and does not apply, say, to sinz.f If it could be proved, 
Fourier’s argument about (1) would be completely justified, since 
this function is of order $ and thus certainly of genus zero, and fulfils 


the condition (2) of the rule of de Gua, i.e. has no critical points. 
Proposition A includes the assertion that the number of critical points 
and that of imaginary zeros are simultaneously infinite, and there lies 
the chief difference between it and the following proposition B. 

Algebraic rules about the reality of roots extend naturally and 

ft Stern applied Fourier’s argument to sin z (I.c., pp. 50-1). Poisson was, 
at a certain moment, inclined to believe that it applied to (1) but not to sin z; 
see Mémoires de l Académie, vol. 8 (1829), pp. 367-8. 
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often quite easily to integral functions of genus 1* having only a 
finite number of imaginary zeros, because these functions (and only 
these) may be approximated by real polynomials having a bounded 
number of non-real roots (see Lemma III in section 6). Thus we are 
led to the following proposition. 

B. If an integral function of genus 1* has only a finite number of 
imaginary zeros, it has just as many critical points as couples of 
imaginary zeros. 

There is no necessary contradiction between Poisson’s statement 
about the zeros of derivatives quoted in the preceding section, and 
the proposition B. If we admit that they are simultaneously true 
we are led to the following proposition. 

C. If an integral function f(z) of genus 1* has only a finite number 
of imaginary zeros, its derivatives from a certain one onward, let us say 
f(z), f(z), ..., have no imaginary zeros at all. 

3. Theorems to prove. I do not know whether any one of the 
theorems A, B, C is true or not. But some information about the 
question is contained in the following theorems I and IT, which I shall 
prove in the sequel. 

TueoreM I. Jf an integral function f(z) of genus 1* has only a finite 
number J of couples of imaginary zeros, then (as is well known) any 


derivative f™(z) of f(z) has also only a finite number J,, of couples of 
imaginary zeros, and 
> - > 
F = J; a J =— Js = **e «© 


Let K be the number of critical points of f(z) and 


J. = lim J,. 


n—>o 
Then J = K-+J,. (7) 

TueoreM II. If the order of a real integral function f(z) 1s less 
than 2 and f(z) has only a finite number of imaginary zeros, its deriva- 
tives from a certain one onward, let us say f™(z), f™**(z), ..., have 
no imaginary zeros at all.+ 


+ I stated this theorem without proof in 1927 (‘Ueber die algebraisch- 
funktionentheoretischen Untersuchungen von J. L. W. V. Jensen’, Matematisk- 
fysiske Meddelelser (of the Danish Academy), vol. 7, No. 17, p. 27, footnote). 
Since the present paper went to press, M. Alander published a proof and 
communicated a more general theorem stated by A. Wiman as probably true 
(Opuscula Mathematica A. Wiman dedicata, Lund 1930, pp. 79-98. p. 93). 
Alander’s proof is identical with my own which will be given in section 4, and 
the theorem suggested by Wiman is identical with the theorem C, suggested 
by Poisson’s remark. 
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it follows from (7) that 

K <J. (8) 
This inequality remains true for any function of genus 1* because it 
affirms nothing in the case J = 00, excluded in Theorem I. The case 
of inequality for (8) presents itself in Poisson’s example (6), where 
K=0 and J=oo. The function (6) is of genus and order 1, and 
thus it does not contradict the hypothetical theorem A, which affirms 
K = J for the genus 0 and so for any order less than 1. Theorem I 
shows further that the hypothetical theorems B and C are equivalent, 
since, by (7), J = K is equivalent to J,, = 0. 

Theorem II shows that C and consequently B becomes certainly 
true under narrower conditions, namely, if we replace ‘genus 1*’ by 
‘order less than 3’. 

Theorems I and II permit us to draw a definite conclusion from 
Fourier’s argument about (1). What Fourier actually proved is that 
K = 0 for the real integral function (1) whose order is } < 2. If (1) 
had only a finite number J of couples of imaginary zeros, we could 
apply first Theorem I, then Theorem IT, and obtain 

J=J,=9. 
That is, it follows from K=O that (1) has either an infinity of 
imaginary zeros or no such zeros. 

4. Proof of Theorem II. I begin with the proof of Theorem II, 
which requires less algebraic machinery than that of Theorem I. 

Let B+-ty, B—iy, where f and y are real, be a couple of conjugate 
imaginary zeros of the real integral function f(z). Consider the ellipse 
whose minor axis has B+-iy and B—‘y as endpoints and whose major 
axis has the length 2yvn ; it will be called a Jensen ellipse of order 
n of f(z). A Jensen ellipse of the first order is a circle, a Jensen 
circle of f(z). I have to use the following well-known lemma: ft 

Lemma I. Jf f(z) ts an integral function of genus 1*, the imaginary 
zeros of f(z) are contained in the Jensen circles of f(z) and, more 
generally, the imaginary zeros of f(z) in the Jensen ellipses of the n-th 
order of f(z). 

Let the function considered in Theorem II be 


F(z) = Cg +¢,2+-c,27+-...+6,2"+-.... (9) 
Its order is less than 2, and therefore its genus is 0, and because it 
3 ) 


7 J. L. W. V. Jensen, Acta Mathematica, vol. 36 (1913), pp. 181-95, see 
p. 190; J. L. Walsh, Annals of Mathematics, 2nd series, vol. 22 (1920), pp. 128- 
44; J.v.Sz. Nagy, Jahresber. d. Deutsch. Math. Ver., vol. 31 (1922), pp. 238-51. 
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is real it is certainly contained in the class of functions of genus 1* 
to which Lemma I applies. f(z) has only a finite number of imaginary 
zeros; thus its Jensen ellipses of order m cover only a finite region, 
interior to a circle 


lz] <b+cvm, (10) 


| 
where 6 and ¢ are independent of m; they are determined by the 
situation of the imaginary zeros 8+-‘y of f(z), b = Max|8|,c = Max|y}. 
The imaginary zeros of f(z), if any, are, by Lemma I, situated in 
the circle (10). 
On the other hand,} since the order of (9) is less than 3, there is 
a constant a, 
0<a< i, (11) 
such that lim c, n™/* = 0. (12) 


n 2) 


[t follows from (11), by a familiar remark,{ that‘there are arbitrarily 
large m such that 


|m™ ‘> ita (m+ ym -k)/ox 


i om 
for k eS 8 Therefore 
mma 1 


< < , 
(m-+-kymrhia * (m--kyklo 
f™(z))__ | QS OM+4)! Cmte x 
mic mik! c 


m m 


m-+-2)...(mM+K) Cm sel ix 
k! Cm i 


(m-+-k)* 1 Lath 


kt (m+k)hla™ 


2) 
|| Ky kk 1-1/ 
ad _—s > lo] —1/a 
> i = 2—exp(|z|m ). 


k=1 
The following argument and the result concerning (13) are due to 
M. Alander, ‘Sur le déplacement des zéros des fonctions entiéres par leur déri- 
I 


vation’ (Thése, Upsala, 1914, pp. 6-8). 
See e.g. G. Pélya and G. Szegé, Aufgaben und Lehrsdtze (Berlin 1925), 


vol. 1, p. 8, no. 108. 
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Therefore f(z) has all its zeros outside a circle, namely, in the region 


> m2 log 2. (13) 
Now, by (11), 


and thus, if m is sufficiently great, the regions (10) and (13) have no 
common point and f(z) has no imaginary zero. It follows, as is 
well known,7; that also f™*)(z), f™*(z), ... have only real zeros. 

5. Critical points. Suppose that f(x) takes real values and is a 
regular analytic function in some interval of the real axis. Suppose 
that € is a real zero of f’(x) of multiplicity m but not a zero of f(x). 
That is 

HE) AO, FE) =f(E) =... =fM(E) = 0, frrr(€) #0. (14) 
Put 


b. ; 
(i) k , if m is even, 


m-+1 
2 


(iia) k , if m is odd and f(&)f™*(é) > 0, 


_m 


—" if m is odd and f(g)fmé) < 0. 


- 


(iif) k 


If k > 0 we shall say that € is a critical zero of f'(x) of the (critical) 
multiplicity k. 

Observe that: 

(a) If m = 1, the point € is or is not a critical zero of f’(x) according 
as f(€)f"(€) is positive or negative, that is, according as the convex 
or the concave side of the curve y = f(x) is turned towards the x-axis 
at the point £, where f(x) reaches an extremum. If m= k= 1, the 
point € is simple (has the multiplicity 1) as a zero and also as a 
critical zero. A zero of f’(x) of multiplicity m and of critical multi- 
plicity may be interpreted as arising from m coincident simple real 
zeros, k of which are critical and m—k non-critical. 

(b) Critical zeros of f’(x) may arise also from coincident imaginary 
zeros. Assume that the analytic functions f(x), f,(x), ... take real 
values along an interval of the real axis, and tend towards f(z), 
uniformly in a rectangle of the complex plane that contains the 
interval in question in its interior. Consider the zero € of f’(x), of 


{ E. Laguerre, @uwvres, vol. 1, pp. 178-80. 
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multiplicity m and critical multiplicity , and a given small neigh- 
bourhood of € in the real points of which f(x)f*»(x) keeps a con- 
stant sign. For sufficiently great n, the sign of f,,(x) f0*(a) will be 
the same, and the neighbourhood will contain just m zeros of f), (x) 
among which 2j* may be imaginary and k* critical. If we examine 
the cases (i), (iia), (ii8) we see easily that in each of them 

2k = 2j*+-2k*. 

(c) If « passes through é, the sequence f(x), f’(x), f’(x), ... con- 
sidered in the Budan-Fourier rule,f loses 2k variations owing to the 
critical zero € of f’(x) of multiplicity k, i.e. it loses just 2k variations 
if there is no deriv wa e of f(x), beyond those mentioned in (14), that 
vanishes at the point €. These 2k variations are ‘wasted’ for f(z), 
i.e. they indicate no root of f(x), but they are not ‘wasted’ for f’(x) 
i.e. the shorter sequence f’(x), f’(x), ... loses just as many variations 
as there are roots of f(x) in the point €. 

A critical zero of f(x), n > 1, is related in just the same way to 
the curve y = f-(x) as a critical zero of f(x) to the curve y = f(x) 
All the critical zeros of f’(x), f’(x), f(x), ... are called critical anlnts 
of f(x). The multiplicity of a critical point of f(2) is the sum of the 
(critical) multiplicities of the critical zeros of f’(a), f’(x), ... lying in 
that point. If c is the multiplicity of a critical point € of f(x), then 
just 2c variations of the sequence f(x), f’(x), f’(x), ... are ‘wasted’ for 
f(x) when x passes through &. 

Consider an interval a, b. Suppose that, multiple zeros of any kind 
being counted with the proper multiplicity, 

N is the number of zeros of f(x) in a, b; 

N, is the number of zeros of f’(x) in a, b; 

K,, is the number of critical zeros of f(x) ina,b (v= 1, 2, 3,...). 

Suppose now that both at a and at b the function f(x) and its deri- 
vatives are different from zero, and further that a certain derivative 
f(a) is different from zero throughout a, b. Let {f(x)} be the number 
of variations in the sequence f(x), f’(x), f’(x), ... f(x), and {f’(x)} 
that in the shorter sequence f’(x), f"(x), ... f(x) 

Then, by (c) and the Budan-Fourier rule, 


N+2(K,+K,+...+Ki-1) = {f(@}—{f)}, (15) 
N,+2(K2+...+Ki4) = {f (@}—f'(O)}- (16) 


+ The Budan-Fourier rule in the generality needed here was discussed by 
A. Hurwitz, Math. Annalen, vol. 71 (1912), pp. 584-91. 
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Since £f(a)}—£f (a)} = 1 —sg fla) f(a), 
and there is an analogous relation for b, we infer from (15) and (16) 
that 
N+2K,—N, = 3{sg f(b) f'(b)—sg f(a) f(a). (17) 

Though the proof supposed more, (17) is valid on the sole hypothesis 
that f(a) f(b) f(a) f(b) 40. This is easy to confirm by direct geo- 
metrical considerations if we divide a, b into partial intervals by the 
zeros of f(x) and f’(x). The following special or rather limiting cases 
are particularly easy. 

(d) If the interval is closed, and both its endpoints are roots of 
f(x), then 


N,=N—1+42K,. (18) 
(e) If f(x) is a polynomial, the interval is open, and one of its 
extremities is a root of f(z) and the other +-00 or —oo, then 
N,=N+2K,,. (19) 
6. Integral functions of genus 1*. An integral function f(z) of 
genus 1* has the form 


2) = ae-v + amt (1 — 2 ) eee (1 —2 
f(z) = 2 ( =) n( =)( 


where : . : 
= | ad 
a, = B,+1y,, a, = B,—» 


a, B, Y; %, By, yy are real, «0, y> 0, a, #0, y,> 0, g is a non- 
negative integer, and each of the two products I] is either absolutely 
convergent, or has a finite number of factors, or has no factors at all 
(in which case it is to be replaced by 1). I prove here two lemmas, 
one on the critical points of integral functions of genus 1* and one 
on their approximation by polynomials. 

Lemma II. Jf f(z) is an integral function of genus 1*, the critical 
zeros of f'(z) are contained in the Jensen circles of f(z) and, more 
generally, the critical zeros of f(z) in the Jensen ellipses of order n 
of f(2). 

I will prove only the first part of the lemma, that about the Jensen 
circles. The second part, which by the way will not be used in the 
sequel, follows from the first and from Lemma I by an easy geo- 
metrical argument} also used in the proof of Lemma I. 

t See J. v. Sz. Nagy, Jahresber. d. Deutsch. Math. Ver., vol. 31 (1922), 
pp. 243-4. 
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Differentiating log f(z) twice we obtain 


f’(zP?—fef"@ _ 


¢ | z—Bf,)°—y,* 
a P 214, 2]2° 

i Bed TY, ] 
Exclude the case (in which there is nothing to prove) that (21) 
vanishes identically, or what is the same, that f(z) is of the form 
veb?, If € is a real zero of f’(x) but not a zero of f(x), and € is exterior 


to all the Jensen circles, that is if 


(€— B,)* = y,*, 


(21) 


then we obtain from (21) 

—flE)f'"(€) > 0. 
Thus € is a simple and non-critical zero of f’(z), which is the relevant 
part of Lemma IT. 

Lemma IT expresses a relation between the situation of the ima- 
ginary zeros and that of the critical points of an integral function of 
genus 1*. It is a fortiori valid for polynomials; and this special case 
may be part of an answer to a rather obscure question of Gauss. 
The geometrical relation given by Lemma II may be expressed in 
a different form: 

Let f(z) be an integral function of genus 1* and & a critical zero of 
f'(z). Let two straight lines passing through & divide the complex plane 
into four right angles two of which are bisected by the real axis. Then 
the two right angles which are not bisected by the real axis contain each 
at least one complex zero of f(z). If € were a critical zero of f(z), 
instead of being one of f'(z), then, in order to obtain the same con- 
clusion, we must open the two angles just considered till the aperture 
of each becomes 2 arctan Vn. 

Lemma III. If the integral function f(z) of genus 1* has only a finite 
number of imaginary zeros, all situated in a certain closed domain 
R of the complex plane, then there is a sequence of real polynomials 
tending to f(z), uniformly in R, and such that each polynomial of 
the sequence has just the same zeros in R as f(z) and only real zeros 
outside R. 

Werke, vol. 3, p. 120. The question is whether there is a ‘definite’ one- 
to-one relation between critical points and couples of imaginary zeros: ‘ob 
jeder bestimmten Liicke ein bestimmtes Paar imaginéirer Wurzeln angehért.’ 
Did Gauss mean a geometrical relation ? 
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If f(z) has the form (20), polynomials of the required kind are given 


S (nl 1 at 2 
+ 2, (a; +a, ) , 


by the formula 


P 


x n, (1-2) m (1-2) (1-2); 
Oty, a,/\ a, 


here p runs through all integers from a certain one onward, the pro- 
duct IT, and the sum &, are extended over the a, whose modulus 
does not exceed p, and P= P(p) is a suitably chosen (large) 
integer.f 

7. Proof of Theorem I. The main point of Theorem I is con- 
tained in the following: 

Lemma IV. If f(z) is an integral function of genus 1* and has 2. 
imaginary zeros, and its derivative f'(z) has 2J, imaginary and K, 
critical zeros, then 

J=J,+K;. 

Relation (22) is easy to prove for real polynomials, and the fol- 
lowing proof extends it with the help of approximating polynomials 
(Lemma IIT) to functions of genus 1* ; this process succeeds because 
the imaginary and critical zeros of the derivative cannot ‘escape to 


infinity’ (Lemmas I and II). But the details need very careful 


handling. 

Consider a rectangle R with horizontal and vertical sides, sym- 
metrical with respect to the real axis. Choose FR so that 

(i) R contains in its interior all the 2J couples of imaginary roots 
of f(z) with the corresponding Jensen circles, and both its vertical 
sides are halved by a root of f(z). These roots are real and will be 
called a and b. 

The choice of R to satisfy the second condition in (i) is certainly 
possible if f(z) possesses an unlimited number of positive and negative 
zeros. I assume this in the following proof, which requires a slight 
modification—use of the more complicated (17) instead of (18) and 
(19)—in the other cases where f(z) possesses only a finite number of 
zeros of one sign or altogether. 

It follows from (i), by Lemmas I and II, that the 2J, imaginary 
and the K, critical zeros of f’(z) are contained in the interior of R. 

+ Compare Hilfssatz ii, p. 20, (Matematisk-fysiske Meddelelser, vol. 7, 
No. 17). Note that ‘erhéhtes Genus 1’ and ‘genus 1*’ mean the same thing. 
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(ii) Let f,,(z) be a real polynomial of degree n which has just the 
same zeros in the closed rectangle R as f(z), and no imaginary zeros 
outside R. 
Let f(z), and therefore f,,(z), have N real zeros in the closed rect- 
angle R. If the number of zeros of f,,(z) outside R (all of which are 
real) is n’, then 


n=2J+N-+7’. (23) 


(iii) Let the zeros of f’(z) inside R be the centres of small circles 
all interior to R and exterior to each other. Choose f,,(z), according 
to Lemma ITI, so that f;(z) has in each of these small circles exactly 
the same number of roots as f’(z) and no other roots inside or on the 
boundary of R, except possibly roots in a and b if these points are 
multiple zeros of f(z). This choice is possible, by Lemma III and 
well-known general theorems. 

By (i) and (ii), f(z) and f,(z) have the same imaginary roots and 
Jensen circles, all inside R. Therefore, by Lemmas I and II, all 
imaginary and critical zeros of f’(z) and f;(z) are contained in the 
interior of R. Let f'(z) have 2J¥ imaginary, Kf critical, and Nf 
real zeros in the closed rectangle R, the corresponding numbers for 
f'(z) being 2J,, K,, and Nj. 

Since f’ (z) has no critical zeros outside R, it has, by (e) of section 5, 


just the same number of real zeros outside R as f,(z). Therefore, 


compare (23), 
n—1 = 2J¢+NT+7n’. 
By (d) of section 5 we have 
N}¥ = N—1+2Kf. 
By (iii) and (6) of section 5, we have 
2J*+2K* = 2J,+2K,. 
Combining (23) and (24) we obtain 
Nf = N—1+2J—2Jf. 
This gives, compared with (25), 
J = Jt+ KT =J,4+ Ky, 
by (26). Thus (22) is proved. 
Suppose that f(z) has 2J, imaginary and K, critical zeros, 
, 2, 3, ...). Since f(z) is of genus 1* and has only a finite num- 
D 
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ber of imaginary zeros, f’(z), f’(z), ... are alsof of genus 1*. Thus we 
have a system of equations like (22), viz. 


J,=J5,+-K,, Jg=J534+ Kz, -.:; 
and thus J=J,+K,+K,4+...+K 
J =J,.+K,+ K,- 


n? 


which is Theorem I. 


+ This is an easy development of a theorem of Laguerre (Huwvres, vol. 1, 
pp. 178-80). See e.g. G. Pélya, Crelle’s Journal fiir die reine u. ang. Math., 
vol. 145 (1914), pp. 224-9, p. 245, Theorem II, from which it follows at once. 











ELECTRICAL NOTES 
By F. B. PIDDUCK 
[Received 21 February 1930] 


|. Relation of two-dimensional problems in electrostatics 
and electromagnetism 
THe evident fact that the electric potential V of an infinite line 
charge and the magnetic potential 2 of an infinite straight current 
are conjugate functions can be generalized to dielectrics and magnetic 
media. We shall pass briefly over the direct proof since the theorem 
is easy to find. 

Suppose for simplicity of enunciation that an infinite cylinder S of 
any cross-section separates in one problem two dielectrics of dielectric 
constants K, and K,, and in the other two magnetic media of per- 
meabilities », and ps. Let there be charges of e, and e, electrostatic 
units per unit length in two lines parallel to the generators in the 
first and second dielectrics respectively, and currents 7, and 7, in 
the same lines in the second problem. Let V be the electric potential 
in the first problem and U its conjugate function such that 

V+tiU = f(x+iy). 
The conditions of each problem are five: Laplace’s equation in two 
dimensions, two conditions expressing the behaviour of the functions 
infinitely near the two lines, and two boundary conditions on S. The 
relations @V/én = 0U/és, @V/és = —é@U/én are just such as will 
transform a tangential boundary condition into a normal one, and 
conversely, and a simple calculation shows that all the conditions 
are satisfied if 2 =A,U in the first space and A,U in the second 


space, where 


Ky fy = Ky pg, Ay Hy = Ap He: €1/ty = g/t. 
The lines of magnetic force in the second problem coincide with the 
equipotential curves in the first, and if one problem is solved by 
images the other has the same image system.* 
The constancy of Ky suggests a more comprehensive view. The 
conjugate function relation is familiar in Rayleigh’s theory of electric 
waves on cylindrical conductors.t| Can we generalize it to cover a 


* Cf. G. F. C. Searle, Hlectrician, 28 Jan. 1898. 
Rayleigh, Phil. Mag. [4], 44, p. 199 (1897); Scientific Papers, 4, p. 327. 
D2 
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dielectric constant which varies from point to point of the cross- 
section ? Obviously only if the permeability varies inversely so as 
to produce a wave velocity the same at all points, so that the wave 
fronts remain plane. Consider the electric waves on a number of 
conducting cylinders parallel to the axis of z, the dielectric constant 
K and permeability » being functions of x, y, subject to Kp having 
the constant value c?/v?. Then the equations 

KéE/ét = ceurlH, div KE = 0, —peH/ét = ccurl E, div »H = 0 


admit the solution 


c OV Kvav 
pe dye by” 
where V is a function of x, y and z—vt satisfying the equation 


a(kz)+a(*G)=° 
Ox Ox oy oy 


Evidently a solution of the electrostatic problem suffices to solve the 
wave problem, and in particular the problem of the field of high- 
frequency currents along the cylinders considered in the electrostatic 
problem. If we make the cylinders infinitely small in cross-section, 


H.= on Sig ee H.=0, 
pv Ox Cc Ox : 


x 


as well as the wave-length infinitely long, we have the analogy with 
which we began. 


II. The electric oscillations on a thin terminated wire. 


The interest of Abraham’s theory* of the electric oscillations on 
a prolate spheroid being in its application to a thin rod, one could 
wish it brought into agreement with Pocklington’s theoryy of the 
oscillations on a thin wire of any form. Rayleight remarked that 
the limit of a spheroid is not a uniform rod but one which tapers 
according to a definite law ; but gave reasons, from the solution of 
another problem, for supposing that the magnitude of the cross- 
section is immaterial provided it is small. The difficulty itself is, 
however, only apparent in our problem. All we require are solutions 
which make the electric force intersect the wire at right angles in 
the limit, and this is provided by Abraham’s solutions, or by the 
* M. Abraham, Wied. Ann. 66, 435 (1898). 
+ H.C. Pocklington, Proc. Camb. Phil. Soc. 9, 329 (1897). 
t Phil. Mag. [6] 8, 105 (1904); Scientific Papers, 5, p. 198. 
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oscillations on any other surface which can be made to tend to a thin 
terminated wire. 

Pocklington’s main result for a thin wire is that the distribution 
of current is simple harmonic in the arc s, provided that the wire is 
either closed or has its extremities at infinity. The restriction is 
unnecessary. Pocklington carried out an integration by parts in 
which he replaced —f A dII by [II dA, the distribution of Hertzian 
oscillators along the wire being proportional to a function A, simple 
harmonic in ¢. We may remark parenthetically that the sign of 
Pocklington’s coefficient « should be changed in order to make it 
clear that we consider only waves propagated outwards from the 
elementary oscillators. Larmor* observed that, since the current in 
the wire is proportional to @A/ét, A vanishes at a free end. Thus 
Pocklington’s conclusions apply equally to a terminated wire. Since 
Abraham found that the distribution of current is simple harmonic 
in s for the limit of a thin spheroid, there is complete accord, and 
both make the fundamental wave-length twice the length of the rod. 
The result is now seen to hold for a rod bent into any curve. 


* J. Larmor, Nature, 67, 361 (1903). 








THE ZEROS OF CERTAIN INTEGRAL FUNCTIONS 
By MARY L. CARTWRIGHT 
[Received 15 February 1930] 
Introduction. 
1.1. I propose to consider functions of the form 
b 
f(z) =f(at+iy) = f(re”) = | ed(t) dt, (1.11) 
where ¢(¢) is a complex function, integrable in the sense of Lebesgue. 
The functions 


b b 
U(z) | cos zt u(t) dt, V(z)- | sin zt v(t) dt (1.1: 


a a 
may be reduced to the form (1.11) by simple transformations. Among 
them are many weli-known functions such as Bessel functions. 

The problem here is to determine approximately the total number 
of zeros, and the position of the zeros for which r is large. Titch- 
marsh* has proved a number of very general theorems of this 
character. In these it is supposed (as is obviously permissible without 
real loss of generality) that a and 6 are the effective limits of integra- 


tion of ¢(t), that is to say that 
a+d b 
| d(t)| dt > 0, | $(t)| dt > 0, (1.13) 
b-8 


a 
for every 5 > 0. It is also supposed that f(0) 40. This hypothesis 
also may be made without loss of generality, since f(0) = 0 involves 


b 
fe) = al ety (t) dt = —2g(2), 


ft 
where x(t) | (ude and g(z) is a function of the same type as 


f(z) with a zero at the origin of smaller order. 
Titchmarsh’s chief results are as follows: 
TurorEM I. Jf z,=1r,e%, z.=1, , where 0 <7, <1, <...; 
are the zeros of f(z), then the series 
2 “ 
7 “ 6, 
Tn 
; 1 
is absolutely convergent. 


* EK. C. Titchmarsh, Proc. London Math. Soc. (2) 25 (1926), 283-302. 
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THEoreM II. Jf n(p) is the number of zeros of f(z) for which r <p, 


then a 
np) ~ — 3 
when p > ©. 


THEOREM III. The series 


2) 
Nisin 4§n 


aa Tn 
1 
is conditionally convergent, and we can write 


f(z) = f(O)etarve I1(1-=) | 


1 z 


the product being conditionally convergent. 


1.2. Pélya* has proved a number of theorems of a different type 
concerning the zeros of the functions (1.11). In Pélya’s theorems 
the hypotheses are much more restrictive, and the conclusions much 
more precise. For example, he proves 

TuHeorEeM IV. Jf d(t) is positive and non-decreasing, then all the 
zeros of f(z) lie in the closed half-plane x <0. 


THeorEM V. If d(t) is continuous, positive and differentiable, except 
at a finite number of points, and if 


—#(t)_ 
a< $(0) <B (a<t<b), 
then all the zeros of f(z) lie in the strip «<a <B. 

He also proves very precise results concerning the reality and 
location of the zeros of the functions (1.12); thus if v(t) is a positive 
increasing function, which is convex in the interval (0,1), and if 
v(t+-0) = 0, then all the zeros of 


) 


1 
V (z) =| sin 2t v(t) dt 
( 


are real, and there is exactly one zero inside each of the intervals 
(or, 37) (27, 8r).... The well-known results for Bessel functions are 


included among these as special cases. 


* G. Pélya, Math. Zeitschrift, 2 (1918), 352-83. 
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Very precise results have been obtained by Hardy,* for particular 
functions, such as 


| . i\31. T(s)z" 
i ee ~ = 
fey—[ ee (e6;)" = > race era 


0 


The first new theorem of this paper, viz. VI, and the substance of 
VII are also due to Professor Hardy. I should like to thank him for 
permission to use his manuscript, and for many valuable suggestions 
and criticisms. 

1.3. My object here is to prove a number of theorems intermediate 
in generality between those of Titchmarsh and Hardy. These 
theorems are of a rather different character from most of Podlya’s, 
more general in some respects and less in others. I begin with a few 
general remarks by way of explanation. 

When zoo or —oo by real values, the behaviour of f(z) is 
dominated by the behaviour of ¢4(¢) near 6 or a, and this remains 
true, roughly, when z + 00 in the part of the plane well removed from 
the imaginary axis. It is essential, in order to obtain any lower limit 
for n(p), to make some hypothesis which ensures that ¢(¢) shall not 
be ‘too small’ near a or 6. In Titchmarsh’s work, this hypothesis is 


(1.13). For my purposes a stronger hypothesis is required, and I 


suppose throughout the paper that ¢(¢) is continuous at a and b, 
and that neither 4(a) nor ¢(b) is 0. This hypothesis ensures that the 
regions s 

8 |cos 0| >« > 0, 
at least, are free from zeros. 

Near the imaginary axis f(z) behaves very much like the Fourier 
coefficient of d(t). It follows from what we have just said that the 
zeros must lie ‘near the imaginary axis’, so that it is not surprising 
to find that there is an analogy between the order of the Fourier 
coefficients of 4(¢) and the distribution of the zeros of f(z). The zeros 
are distributed more or less like those of the standard function, 

i. a 

. sinh | }(b—a) z—4} log * ) \. 

is is $(b)/) 
the error in (p), in nearly all the cases considered, is p times the order 
of the [p]th Fourier coefficient. This rule also holds in the general 
case considered by Titchmarsh, since, by Theorem II, the error for 


* G. H. Hardy, Proc. London Math. Soc. (2) (1904), 401-31. 
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n(p) is o(p), while the [p]th Fourier coefficient of 4(¢) is o(1); but I shall 
prove that there are exceptions. 

There are, naturally, corresponding results if ¢(¢) tends to 0 or to 
20 at the ends in a fairly regular manner, but these I must leave for 
another time. 

1.4. It is convenient to take —1 and 1 as the limits of integration. 
The general case can be reduced to this by an obvious linear trans- 
formation. 

[ shall also suppose that ¢(1) = ¢(—1) = 1. The case in which ¢(¢) 
does not vanish at the ends can be reduced to this. For putting 


| _ (6—I)\* gO _ 
z= atblog’ gay > Y= 1 gay J ad(—D} 


and making the functions one-valued by convention, we have 


fe) = oorg(—D¥ |" 


eld, (t) dt = {4(1)¢(—1)}49(z,) 
1 


where ¢,(1) = d,(—1) = 1. 
Throughout much of the working we have to consider 
2f(z) = f,(z) = Re®®, 
where R= R(z), @=O@(z) are real, but n(p) always denotes the 
number of zeros of f(z) for which r <p, except in 3.21 and 3.22, 
where it obviously refers to the zeros of F(z). 


$(t) an integral. 
2.1. Hardy has proved 
: t 
THEOREM VI. Jf d(t) is an integral, 1.e. of (t) -| ¢'(u) du, then 


fo 
the zeros of f(z) are determined asymptotically by the formula 


z= +mri+e,, 
where m is a positive integer and «,, > 0. 
We shall require the following lemma, which has been proved by 
Titchmarsh.* 
Lemma A. If x(t) is integrable, then, 


1 
| ep(t) a= ofericos ay 
-1 


uniformly with respect to 0. 
* Loc. cit. 288. 
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Integrating by parts, 


fey=*— 3 etd’(t) dt. 


Since ¢’(¢) is integrable, 
file) = €—e-*—oferem 

= e*(1+<«,)—e*(1+,), 
where «, denotes generally a function of r and @ which tends to 0 
uniformly in #6, when roo. At a zero of f(z) 

e?(1-+-e,) = e-*(1+-e,), 
e — ]+e,; 

and hence 2z = +2mm7i+e,,, 
where m is a positive integer and «,, > 0. 

It remains to prove that one and only one zero of f(z) is associated 
with each large integral value of m. The function sinhz has exactly 
one zero at each point z= +m7i. Let 5 be any sufficiently small 
positive number, and let us surround each of these points by a circle 
of radius 6. Then 

\sinhz| > (8) > 0 
on the circle. Also the ratio between the moduli of the two terms 
of sinhz lies between certain fixed limits, depending only on 6, on 
the circle. Hence, on the circle, we may write 
fi(z) = sinhz (1+e,,), 
where «,,>0 as moo. It follows that f,(z) has, for large m, the 
same number of zeros inside the circle as sinhz, that is to say, one. 

2.2. We have thus determined the large zeros of f(z) with con- 

siderable precision. It is easy to show also that the total number of 


zeros is the same for f,(z) and sinhz, i.e. that 


when p is large and p/z is not too near an integer. 


$(t) of bounded variation. 
3.11. We know that, if a,, is the mth Fourier coefficient of 4(¢), then 
a,, = 0(1) for an integrable ¢, 
a,, = 0(m-1) when ¢ is an integral. 


in either case the error in n(p) is p times the order of Ay). 
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If 4 is of bounded variation, a,,— O(m-1), and if ¢ belongs to 
Lip «, i.e. if b(t-+h)—(t) = O(\h|%), 
then a,, = O(m-*). Analogy suggests that the corresponding errors 
in n(p) should be O(1) and O(p!-*). This is in fact true, but the 
proofs are decidedly more difficult. 

Since a function of bounded variation is the difference of two 
monotonic functions, it is natural to expect also an analogue of 
Theorem IV. This is Theorem VII below (an essential preliminary 
to Theorem VIII, which is the theorem for n(p) when ¢ is of bounded 
variation). 

TueoreM VII. Jf d(t) is of bounded variation, then all the zeros of 
f(z) lie in the strip \x| <K for some K. 

TuroreM VIII. Jf d(t) is of bounded variation, then, as p > 2, 


n(p) = 2 + (1). 


The following example shows that there is no one-to-one corre- 
spondence between the zeros of f(z) and sinh z as in the ‘integral’ case. 
Suppose that, f(z) not having been reduced to the standard case, 

p(t) = —1 (—1<t<0) 
(0<t< 1). 
_ 4sinh*}z 


Then the zeros of f(z) = 


ae 
~~ 


occur in pairs at the points z= + 2mmi, m= 1, 2,..., instead of 
singly at or near the points z= +(m—4)mt, as they would if the 


result of Theorem VIII still held. 
3.12. Proof of Theorem VII. Suppose first that x > 0. Integrating 


e?—e-* ] 1 5 
| e dd 
“J-1 


by parts 


~ 
~ 


1-§ 1f) 
| et ig—2| e# dd. 
1 231-8 


3 | 1-8 
| et d¢| F era \dd| we He-8), 


=} 1 


| et dd|<et Max | (t,)—(t,)| 
1-3 1 


3 1-3 <4 <k< 
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where «(5) > 0 as6—>0. Hence 

fi(z) = &+e%(8) + Ofert-9}. (3.121) 
By choosing 8 first sufficiently small, and then 2 sufficiently large, 
we can make 

le(3)-+O(e-82)| <4, 

and so R(z) > $e” 
for x > K. In particular f(z) has no zeros for these values of x. The 
zeros with large negative real part may be disposed of similarly. 

It is plain that we have proved incidentally that f(x) is bounded 
in any strip i which |x| is bounded. This can also be proved by the 
second mean-value theorem. 

3.21. The proof of Theorem VIII depends essentially on the fol- 
lowing lemma. 

Lemma B. Suppose that F(z) is regular for r <r’, F(0)=1,k >1, 
and that M(r’) is the maximum of |F(z)| on the circle r=r'. Then 
there is a number H = H(k) such that 


|e 
| d{arg F(r’e’)}| < Hlog M(r’), (3.211) 


% 
for |O@—05| < 22 and r” <1’ Jk. 
It must be observed that if 6 = 0,+ 27 the result is comparatively: 
trivial. For we can show by Jensen’s* formula that 


nr") < log me) _ log M(r’) 
log (r’/r”) log k 
and therefore 
2a log M(r’) 


log k 


0, +24 
| : "dfarg F(r’e*9)\ — 2nn(r") < 
6 

(If a zero of F(z) lies on the circle r=p, I suppose that the contour 
is indented so as to include the zero.) 

That the case in which |9—0,| < 27 should be less simple is natural. 
Suppose that F(z) is an integral function. Then if we integrate round 
a closed contour, the arguments of the factors belonging to the zeros 
outside the contour, and of all the exponential factors in the canonical 
product for F(z), return to their initial values. When we integrate 
round part of a contour only, these factors have to be considered. 


* See (3.221). 
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The proof of Lemma B is based on the proof of the following result 
due to Valiron.* 

Lemma C. Suppose that the hypotheses of Lemma B are satisfied. 
Then there is a number H = H(k) such that 

log | F(r’e’®)| > —H log M(r’) 

for some r” such that r'/k? <r" <1’ /k. 

The proofs of both lemmas require Caratheodory’s} well-known 
inequality, Jensen’s formula, and the following lemma due to 


Mr-a)--( 


where \z,| <r’, and thatk>1. Then 
|P(2)| > {(k—1)k-*(20)-#*"}" 
on some circles in each of the annular regions 
rilkcr<r', r/RP<or<r' /k. 
3.22. Proof of Lemma B. We can write 
F(z) = P(zje*™, 
where P(z) is a oma formed with the zeros of F(z) inside the 


Boutroux.t 
Lemma D. Suppose that 


circle r=r'/k*, P(O)=1, @(z) is regular inside this circle, and 
(0) = 0. It follows from Jensen’s formula that 


N(r’) = ~ du = |, 18 | F(r’e*®)| dé < log M(r’). 
7 Jo 


2 


o U 


Since n(u) is a non-decreasing function, 


logk _ n(u) — 
~n oe d <N > 
"(a 3 <n Sais ie 


r’\ _ 3log M(r’) 
SO that (ia) S log k ° 


Hence by Lemma D, there is a circle r=r’” such that 
k-tr’ <r" < k-t’ 
on which | P(z)| > {M(r’)}-®". 
* See G. Valiron, Lectures on the General Theory of Integral Functions, 
Toulouse (1923), 89. See also G. Pélya, Journal London Math. Soc. 1 (1926), 
78 footnote. 


+ See G. Valiron, loc. cit., 20. 
t P. Boutroux, Acta Math. 28 (1903), 97-226 ; see also G. Valiron, loc. cit. 80. 
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F(z) 


nce 2)} — gh , +H? 
Hence jee) — >(2)| ~ {M(r’)} 


for r= ¢” 
Let p(r), B(r), and A(r) be the maxima of |®(z)|, |3{@(z)}|, and 
N{O(z)} for |@—O,| < 2a. By Caratheodory’s inequality 
Bir) < wlr) < 37 — A(l-w’), 
for r <k-ir’. But it follows from (3.222) that 
A(k-tr’) < A(r’”) < (1+H’) log M(r’), 
4r 


and so B(r) <(1+- A’) —— 
k-tr’ — 


log M(r’), 
ma 
for r< k-ir'’. 
Now we can express arg F(z) in the form 
arg F(z) = arg P(z) + 3{P(z)}. 
It follows that 


r@ 6 6 
| d{arg F(r"e'®) | d{arg Pirre)+-| dAX{P(r"e"*)}. (3.223) 
% 9 % 


We have seen that the number of factors in P(z) is less than 
3!og M(r’)/logk. Hence 


| (8 
| d{arg P(r"e'®)} < 


67 log M(r’) 


; ee (3.224) 
0 >” 


for |@—0,| <2. Also, if r”<71'/k, 


|S{@(r"e)}| < B(r”) < (;) : eof i Nog M(r') = 
4(1+-H’) 

= ] 
y-1 


Therefore 


6 > \ , 
| dX O(r"e')}) sr og Mir’). (3.225) 
4 Gi 


Combining (3.224) and (3.225) with (3.223), we obtain (3.211). 
3.23. Proof of Theorem VIII. We saw in 3.12 (i) that f,(z) is 
bounded in any strip in which |x| is bounded, and (ii) that there is 


a K such that R(z) > he 


for « > K. We can therefore find an H, = H,(K)* such that 
R(z) < hy, (3.231) 


* In what follows H always denotes a number depending on K and on 
previous H’s, and therefore ultimately on K only. 
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for |x| <10K, and an H, = H,(K) such that 
R(z) > He (3.232) 
for x= K. 
We denote the line x = K by L, the circle r = p by I’, and their 
points of intersection by a, and d,,a, being above the real axis. We 


denote the circles |z—a,| = 2K, |z—d,| = 2K by c and € respectively. 











Finally, we suppose that c cuts T in pe“: and pe’, and that é cuts 
I in pe“, pe@7-9é (the first of each pair of points being that whose 
real part is positive). Now let 


fil@) _ fl *") _ pry), (3.233) 
fi(@,) fi(@,) 
and apply Lemma B to F(u) with r’ = 8K, k= 2, and r”= 2K. It 
follows from the lemma, and (3.231) and (3.232), that 


|f d®| < Hyg, (3.234) 
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where H, = H,(H,, H,) = H,(K), and the integral is taken round any 
are of c. There is obviously a similar inequality for the integral taken 
round any arc of ¢. 

Let C, and C, be the contours formed by (i) the arcs of I’ from 
pe“: to pe“”:, and from pe": to pe®-%, (ii) the connecting arcs of 
c and é, C, being completed by the ares of c and ¢ remote from the 
origin, and C, by those nearer the origin. If C is C, or C3, and 
n(C) the number of zeros of f,(z) inside or on the boundary of C, then 

] 
n(C) = —. | de = 1,41, (3.235) 
2mt No 
where J, is the part of the integral corresponding to the ares (i), and 
I, that corresponding to the arcs (ii). It follows from (3.234) that 
|I,| < Hy. (3.236) 

The integral J consists of two parts, J; and J}, the first lying in 
the half-plane z > K and the second in the half-plane x < —K. In 
I, we have by (3.121) and (3.122) 

f,(2) = e+ e%e(5)+ Ofert—d) 
= e(1+40), (3.237) 
where |f| < }. It follows that 


1, =£sin6,+0(1) =£-;-O(1), 
=> 7 


‘ 


since 0, differs from 47 by O(p-*). Similarly 


I; =£+0(1), 


2p 
and so =——+ O(1). 
7 


Combining (3.238) with (3.235) and (3.236) we see that 


Since this is true whether C is C, or C,, and n(p)+1 lies between 
n(C,) and n(C,), we deduce the required result. 

3.3. We may replace C, and C, by the lines x = +X and the ares 
of c and é joining them, since the new contours enclose the same 
numbers of zeros. We can then connect the lines «= +K by the 
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ares of |z—K| = 2K on either side of the origin. Hence we can show 
that the number of zeros for which r < p, y > 0, is 

P4+O(1). (3.31) 
7 
if b(t) is real, the zeros are pairs of conjugate complex numbers, 
and (3.31) is a trivial deduction from Theorem VIII. Also Theorem 
VIII can be proved without using Lemma B. We proceed as follows : * 
Replace the contours C’, and C, by the rectangles « = +K, y= +p 
and a= +K, y= +(p—K’), where K’ >p—p?—K*. We denote 
the new contours by C, and Cj respectiv a. Then we still have 
n(C3) <n(p)+1 < n(C}), 
nd it is only necessary to consider { d® taken along the lines 
y=+p, y=+(p—K’), 
since the integral ri along x = +K can be evaluated as before. 
Suppose that t{f,(z)} vanishes g times between K +-ip and —K +t; 
then the interval can a divided into g+1 parts, throughout each of 
which ‘{f,(z)} is of constant sign. Hence in each the variation of 
©@(z) is less than z, so that its total variation is less than (¢+1)z. 
But on y= 0 
Nifile+tp)} = Hfilz+tp)+file—tp)} = Fe). 
Therefore g is less than or equal to the number of zeros of F(z) inside 
any circle enclosing the interval —K <a < K of the real axis. Hence 
applying Jensen’s formula as in (3.221) we can show that 
fd0| <(q+1)z <4,, 
where the integral is taken along y=p from K-+ip to —K-+ ip. 
Similar results can obviously be obtained for y = —p, y= +(p—K’). 
3.41. The result of Theorem VIII shows that 
N(p) = \ ~~ du = . O(log p). 
» « 7 
[ shall, however, prove a much more precise result. 
Turorem IX. If d(t) is of bounded variation, then 


N(p) . =P log p —log f(0)|+0(1) (3.411) 


77 


* Compare R. J. Backlund, Acta Math. 41; also J. D. Tamarkin, Journal 
London Math. Soc. 2 (1927), 66-9; and C. T. Wilder, Trans. Amer. Math. Soc. 


18 (1917), 415-42. 
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This shows that ‘on the average’ f(z) has the same number of zeros 
as (sinh z)/z, for which 


N(p) - a asi log p + (5) 


7 


i 


The proof is based upon Jensen’s formula 


- 9 
Th <7 Jo 


mm. | “tog f(pe®)| dd —log |f(0)|, (3.412) 
WHET 7), Te, «++5 r, are the moduli of the zeros for which r < p. It 
is easy to evaluate the integral in (3.412) along any are for which 
«| > K, but we require some additional machinery before we can 
deal with the strip containing the zeros. 
3.42. Proof. By (3.121), given any » > 0, we can choose K so that 
file) = e(1+9), (3.421) 
where |¢| <1 for a >K. As before, let a, and @, be the points of 
intersection of r= p and «= K. We have again 
R(z) < H,(K) = H,(), (3.422) 
for |x| < 10K, and 
R(z) > HK, n) = Hn) (3.4221) 
K. Hence we may apply Lemmas B and C with 7’ = 8K, 
= 2, r”=y to the function F(u) defined as in (3.233), and we have 
[ dO| - H,(n), (3.423) 
and 
R(z) > —H,(7), (3.424) 
on some circle z—a,|=y, where 2K < y< 4K. There are obviously 
similar inequalities for some circle |z—d,| = y’, where 2K < y’ < 4K. 
We now denote the circles |z—a ,|= y, and |z—d,| = y’ by c and @, 
and denote their points of intersection with by pe, pe’, pe-#%, 
pe- ‘8s 


We construct C from I and the new c and € as in 3.2. We also 


construct a loop round each zero inside C, each loop consisting of 
a small circle of radius 6 round the zero and two lines joining it to’ 
some point on one or other arc of [.. The loops must be arranged 
so that they do not intersect, and so that p/7+O(1)* of them are 
joined to the arc in the right half-plane, and the rest to the are in 

* Theorem II shows that there are more than enough loops to do this, or 


we may consider the unsymmetrical function e-* f(z), and all the loops can be 
joined to the arc in the left half-plane. 
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the left half-plane. Finally, from C together with the loops we 
construct a closed contour D(8) inside which f(z) has no zeros. It 
follows that log f(z) is regular inside D(5), and we can apply Cauchy’s 
theorem. 

We make 6 + 0, and take the real part. We write 


n’ 
N(C) = log P 
"Olle. Tu 


vu 


> 


where r,, 72, ... 7,, are the moduli of the zeros inside or on the 


nt 


boundary of C, and we have 


| } log fiz) ae| — log |f(0)| 


x f(z 1 z 
log i) az} {1 [ 18? az} — tog \f(0)| 
. 2 ) |2ai},. z ) wo 

p — log |f(0)}. (3.425) 
J=141414+1, (3.4251) 

where J, and /, denote the parts of the integral corresponding to 
the ares of é and c respectively, and J, and J, the parts corresponding 
to the ares of T in the right and left half-planes respectively. Then 

] 


‘ 2a 27 


7 


0; ; Ay 
I, | log R(pe'®) dé a cos 0 dé +<«(n) =" + €(n), (3.426) 
6; —6 i 


where e(n) > 0 as ny > 0. Similarly, 


1,= P+-€(n). (3.427) 


~ 


Also IT, <3-| log oat Ol) ide). (3.428) 
[t follows from (3.424) and (3.422) that log R(z) is bounded on c, and 
similarly on é. It follows from (3.423) that @(z) is bounded on the 
ares of c and é considered, provided that it is bounded at the begin- 
ning of each arc, i.e. at pe’ and pe-. We may suppose that 

O(z)| < m at pe-*, Then by (3.428) 
[,| : Ho | = 2 on. (3.429) 

. @| * p 
A similar inequality holds for |J,|, provided that ©(z) remains 
bounded at pe”: after z has described the part of the contour D(8) 
E2 
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from pe~’2 to pe’, We have seen that Q(z) is bounded on é and 
therefore at pe-*1, As z moves along the are of T° from pe-* to 
pe’, Q(z) increases by 2p+O(1), but in describing each loop it is 
reduced by 27. It follows from the arrangement of the loops that 
Q(z) is bounded at pe“, and hence 
I5| < Hys/p. (3.4291) 
Now combining (3.426), (3.427), (3.429), and (3.4291) with (3.425), 
we have 
; 2 a H,: 
N(C) = — log p — log |f(0)| +e(n)-+—*, 
7 p 


é 


where H,, depends on 7, and is bounded as p 00. Hence 


2 
N(c)=t— log p — log |f(0)|-+-o0(1). 


7 


It follows from (3.221), and (3.422) and (3.4221) that the number of 
zeros inside c and é is bounded as p > 0, and so 


N(C)—N(p)| < Hy, Max |log (p/r,,)|, 


where 7,, runs through the zeros inside c and ¢. But |r,—p| < 3K, 


so that |N(C)—N(p)| O(1/p). 
The required result follows at once. 

It may be observed that the result could be much improved if the 
approximation in (3.421) as x > o, and therefore the error in (3.426) 
and (3.427), were better. For the error terms arising from ¢ and é 
are O(1/p). 


$(¢) continuous. 


4.1. The methods used for Theorems VIII and IX can also be 
applied when ¢(¢) satisfies a Lipschitz condition, or, more generally, 
when ¢(/) is any continuous function. We define the ‘modulus of 
continuity ’ as follows : * 

Let ¢4(t) be continuous in the interval (a, b); consider any two 
points ¢,, ¢, in (a, b), and let w(5) be the maximum of |¢(¢,)—¢(é,)| 
for all points ¢,, t, such that |f,—t,|< 5. Then w(d) is the modulus 
of continuity of (t) in (a, b). 

w(5) is a continuous, non-decreasing function of 5 and tends to 


* See Ch.-V. de la Vallée Poussin, Legons sur lapproximation des fonctions 
dune variable réelle, Gauthier-Villars (1919), 7. . 
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zero with 8. If w(s)=O(6%), where 0<a<l, ¢(t) satisfies a 
Lipschitz condition of order a. If w(5)—o0(8), ¢(¢) reduces to a 
constant. 

The results for «= 1 are included in those of Theorems VII and 
VIII, but not in that of IX. The natural generalization of Theorem 
VII is ; 

TuHroreM X. If d(t) is continuous in (—1, 1), then all the zeros of 
f(z) lie inside the region \x|< Krw(1/r) for some K. 

The [r]th Fourier coefficient of ¢(t) is of the form Of{w(1/r)}, and 
we have, as we should expect, 


THEeorEM XI. Jf d(t) is continuous in (—1, 1), then 


no) =%84+-000()). 
7  \p/) 


7 


as p>. 


4.2. Proof of Theorem X. Divide* the interval (—1, 1) into 2m 
equal parts, and let 


p p—i__,_P 
P(t) 6(2) ~ byt) ( nm : Py. 


Then 


Suppose that x > 0; then 


m 


prim Pp 


oF 
oe va $ 4) Pol It 


Pp pim Pim 
all e# dt — | e%y,(t) dt 
m (p-1)/m (p-1)/m 

m 


pim l 
ev*~y( — } dt 
(p-1)/m m 
1 


yp —m-+ 


1 ] ot _ p—x 
: o(=)| et di = o( 5) : ; 
m - m x 


I am indebted to Professor Titchmarsh for suggesting that I should sub- 
divide the interval instead of attempting to use fractional orders of integration, 
as I had done previously without success. 
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=... 
7) 4 


where 


: l 1\ e-alm 1 
I,| < prim o(=) = a ) : — < Hygo( =) Z (4.24) 
m mj] 1—e-zim ~ Ams) @ 


provided that m > m,(x). Hence, combining (4.22), (4.23), and (4.24) 
with (4.21), and putting m = [r], we have 


ee ” ae 1 tian {ra(1/r) 95 
file) =e 4( +=) ero {rll . (4.25) 


, 2 


By choosing K sufficiently large we can make 


$(—14+)e*+0(C") <k, 
| m x | 


and so R(z) > te*, (4.26) 


for x > Krw(1/r). In particular f(z) has no zeros satisfying this con- 
dition, and the zeros with large negative real part may be disposed 


of similarly. 
4.3. Proof of Theorem XI. In 4.2, if x >0, we have 


2m—1 


1 
1 
and FARES = -)| : 
, 


Hence R(z) = Ofe*'rw(1/r)}. (4.31) 
The inequality obviously holds also for 2 <0. Hence we can find 
H,, such that 

R(z) <_ eHisro(l r) (4.32) 
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for |x| <10Kre(1/r). It follows from (4.26) that 
R(z) > 4eKroaln (4.33) 
for z= Krw(1/r). 
We denote the line 2 = Krw(1/r) by L, the circle r =p by T, and 
their points of intersection by a, and d@,. We denote the circles 
z—A,| = 2Kpw(1/p), and |z—d,| = 2Kpw(1/p) by ¢ and é, and we 
def fine F(w) as in (3.233). Then by (4.33) and (4.32) 
| F(w)| < eFax ptiip) (4.34) 
for |z—a,| <8Kpw(1/p). Hence we can apply Lemma B with 
r’ = 8Kpw(1/p), k = 2, r” = 2Kpw(1/p), and we have 


| a0| = | d{arg F(u)}| < His po(*), (4.35) 


where the integral is taken round any are of c. A similar inequality 
holds for é. 

Let C, and C, be the contours formed from c, é, and [ as in 
Theorem VIII, and let C denote either C, or C,. As before (3.235) 
holds, where J, is the part of the integral corresponding to the arcs 
of I’, and J, that corresponding to the ares of c and é. It follows 
from (4.35) that 

|I,| < Hy pe(1/p). (4.36) 

I, consists of two parts, J; and Jj. In I, x> Kro(1/r), and in 

I; « <—Kro(1/r). In Jj therefore, by (4.25), 


Lo = 
(1 
fie) =e {14-0(C) + of = e+0), 
where |¢| <4. It follows that 
I; ="siné, + O(1) = £+-0()+0{pux(")}, 
7 7 p 
for 6, differs from $2 by Of{w(1/p)}. Similarly 
7, =2+0()+0{pue(*)). 
Combining (4.36), (4.37), and (4.38) with (3.235) we have 


n(c)=2+-0[pa(-)|. 


and the required result follows as before. 
As in 3.3 we can replace C, and C, by contours formed from 
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the lines 2 = + Krw(1/r) and the connecting arcs of c and €. We can 
also show that the number of zeros for which r < p, y > 0 is 


c+ ofe() 


4.5. In place of Theorem IX we have 
THrorEM XII. If (t) is continuous in the interval (—1, 1), then 


2 ; 1\ ) ee : 4 
N > log op — log!) f(0)| 4 oO} tL ‘(-) +O! -)! - 
(p) =—-— log p og| f(0)|- (eet 7] fae bn ee?) 


as p> &. 
Proof. Since the function F(u) of 4.3 satisfies (4.34) we can apply 
Lemmas B and C with 7’ = 8Kpw(1/p), k = 2, r” = y. We thus obtain 


|{ do} <i H(K)peo(*) (4.51) 


and log R(z) > Hy(K)po(*) 
Pp 


on some circle |z—a,| = y, where 

2K pw(1/p) << y < 4Kpw(1/p). (4.53) 

Similar inequalities hold for some circle |z—d,| = y’, where y’ also 

satisfies (4.53). We denote these circles by c and €é, and construct 
C and D(8) as in 3.42. 

As before, we obtain (3.425) and (3.425 


fil)=e {1+0(7"t ”) =<¢ 


x 


), and by (4.25) we can put 


] 

(1490/2) 

( cos @ /) 
in J,. It follows that 


0; 0, 
I, : a cos 6 dé- O{u(-)| sec 6 d0| 
Qn }_¢ U \p/ ) 6, J 


Pio! wi(*) | +0{ (=) og a 2 
we Ne) ON) Fold) 


since 6, differs from }7 by Of{w(1/p)}. Similarly, 


(4.55) I, 2 +0|pux(")| LO '«(") log — }. 
am  \p/} pp) (1 /p) 
It follows from (4.32) and (4.52) that 
\log R(z)| < Hy. pw(1/p) 
on ¢ and é, and it follows from (4.51) that 


|O(z)| < Hs pw(1/p) 
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on the ares ofc and é considered, provided that the inequality is 
satisfied at the beginning of each arc. This is easily seen to be the 
case by supposing that |@(z)| < 7 at pe~®, and considering its varia- 
tion along D(8). Hence, using (3.428), we have 


I|}< Haspo(*) | < Haspo'(). 
P é Pp 


and a similar inequality for |J,|. 
Combining (4.54), (4.55), and (4.56) with (3.425) and (3.4251), we 
have 
1 


2 : 1\) l 


By (3.221) and (4.34) the number of zeros inside c and € is less than 
H 5, pw(1/p), so that 
N(C)—N(p)| < Hag po( = Max log 
p rp 
where 7, runs through the moduli of the zeros inside c and ¢. But 
r,—p| <3Kpw(1/p), and therefore 
\N(C)—N(p)| = Ofpw*(1/p)}. 
Hence we have the result of Theorem XII. 
1.6. If w(d) > K’dlogs, the second error term may be neglected. 
In this case the result of Theorem XI may be deduced from that of 
Theorem XII. For suppose that k > 1 and nearly equal to 1. Then 


k 
n(p)logk <|°™™) du < N(kp)—N(p) = 


a 
p w 


— 7? (k—1)—logk + Opus) 
7 UU \p/J 


2e(kK—1)_ 41 g{pm*(1/p)) 


and hence n(p) olaak | logk | , 
ms fh 5 


Put k = 1+(1/p); then 


k—1 
7 —er 
logk I+Ofa(l/p} I 


; 2 1 
[t follows that n(p) < =+0 {p(<) | ; 
w  V Ap/] 
The opposite inequality is obtained by considering 


{' n(u) ee 


p k Uu 
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Exceptions to the rule 
5. The analogy between the error term for (p) and the order of 
the Fourier coefficients does not by any means always hold. This is 
because the behaviour of ¢(¢) in the middle of the interval (—1, 1) 
has little effect on |f(z)| when |x| is large. In fact we have 
THEOREM XIII. Jf (t) is continuous in (—1, —1+-y) and (1—7, 1), 
then the results of Theorems X, XI, and XII hold, provided that we 


replace w(1/p) in each result by 


(5) = Max 'o(2): loge) 
p ( \pe/ pp ) 


For x > 0 we write 


1—y 1 
fe) =| e*d(t) dt "| e*4(t) dt = 1,+-1,. 
— 1-7 


ins I.=©)1- po(™ (I my! 
f { x ) 


It follows from Lemma A that 


ov 
I, — o{e—97} = O 5 of 
r 


for x = (logr)/y, and a fortiori 
L— oe lost r) 
. | raf’ 
5.% 


for x > (logr)/n. Combining (5.2) and ( 


file) =e 1+0 (mae) | 


(5.3) 


3) with (5.1), we have 


a 
The result of Theorem X follows as before with w, instead of w. 
In order to obtain the results of Theorems XI and XII we have 
in addition to use (4.32), i.e. we have to show that 
R(z) << ef yorw(1/r) (5.4) 
for |x| < 10Krw,(1/r). We need only consider the integral from 
—1+y to 1—». For the usual arguments show that 
1+7 to l—»n. F g 
\z I,| “ eA o7rwyl r) (5.5) 
and a similar inequality holds for the integral from —1 to —1-+-y 


For the values of x concerned we have 


z | : ed (t) dt = ofre'=!4-n)} (5.6) 


— ofelOKA—n)r,(t r)+ logr} . ofe esr, m}. 
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Combining (5.5) and (5.6), we have (5.4). The rest of the proofs of 
Theorems XI and XII can now be applied without further difficulty. 

[t remains to give an example of a function for which the analogy 


fails. Consider 
@ 


e2nit 
Wo=>S. 


m=1 
The series is uniformly convergent for |t| <7, and therefore it is the 
Fourier series of its sum. But, when 7 is of the form 2”, 


eo ee log 2\? 
"m=? \logn/ ° 


o(t)=1 (—l<t< —1+7) 


=4(5" ) (—1+ <t< 1-7) 
— 


Now let 


== § (l—9» <t< 1). 
Then by Theorem XIII 
9 
n(p) == + O(log), 


but, when [p] = 2”, 


log 2 \? 
“el ~ \log [pl ’ 


and so the analogy fails. 











A GENERALIZATION OF THE QUADRATIC 
DIFFERENTIAL FORM 

By OSWALD VEBLEN 

[Received 28 February 1930] 
Wiri each point x = (x! 2? ... x”) of a space of nm dimensions there 
is associated a flat space of n dimensions whose co-ordinates are the 
sets of differentials dx = (dx! dz? ... dx”). We shall refer to these 
spaces in which the co-ordinates are sets of differentials as tangent 
spaces. They are affine spaces because their co-ordinates undergo an 


affine transformation, 
dit = vida, where vi=>—, 


whenever the co-ordinates x of the ‘underlying space’ are subjected 
to an arbitrary analytic transformation, 
x &* (a). 
The group in the tangent space is, indeed, the subgroup of the affine 
group which leaves the origin dx = 0 invariant. This point, dx = 0, 
will be called the point of contact of the affine space with the under- 
lying space. 

A quadratic differential form determines in each tangent space 


a quadratic cone, ieee 
sca a,dx'dal 0. 


with its vertex at the origin. The generalization which we propose 
amounts to replacing these cones by non-degenerate quadrics, 
a, dx'dai +-a,dx' +d, = 0. 

The theory of these quadrics is a chapter in the generalized pro- 
jective geometry.* On account of the relationship between this 
theory in » dimensions and the generalized affine geometry of n+1 
dimensions, the formalism runs closely parallel with that of the 
quadratic differential form and many results can be taken over 
directly. Moreover, the formulas in four dimensions are in part 
identical with those of the Five-dimensionai Relativity of T. Kaluza, 

* For an introduction to this subject and references to literature by Weyl, 
Cartan, Schouten, Eisenhart, Thomas, and others, see a paper entitled 


‘Generalized Projective Geometry’ in the Journal of the London Mathematical 
Society, vol. 4 (1929), pp. 140-60. This paper will be referred to below as G.P. 
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O. Klein, and others. This relationship will be dealt with in a paper 
which is being prepared in collaboration with Mr. B. Hoffmann, but 
the present paper does not presuppose any knowledge of it, and such 
formulas as we require are either derived in the text or taken from 
well-known general theory. 

|. Homogeneous co-ordinates in the tangent spaces. In each 
tangent affine space points at infinity can be introduced in the usual 
manner so that it can be regarded as immersed in a tangent pro- 
jective space. The points o* the latter will be designated by means 
of homogeneous co-ordinates having a factor of proportionality which 
we shall write in the form exp(—z®). The parameter 2° (which is 
really the negative of the logarithm of the factor of proportionality) 
will be referred to as ‘the factor’—following a convenient terminology 
introduced by, Mr. Whitehead.* 
The invariant theory of homogeneous co-ordinates must take 
account of a change of factor, 
£° = 2°-+-log p(z* a* ... 2), (1.1) 
as well as of a change of co-ordinates, 
(1.2) 
The projective tensors defined in G.P. are a particularly simple class 
of invariants with respect to these transformations. One of the 
simplest projective tensors is a projective scalar of index 1}. This 
is an invariant which has one component, 
A =e®f(x! x? ... x”), 


for each co-ordinate system and factor,{ and whose law of trans- 


formation is Lu 


Thus in each co-ordinate system a projective scalar has an infinity 
of components, for a change of factor (1.1) changes A into 


J. H. C. Whitehead, ‘On a class of projectively flat affine spaces’. This 
paper will be referred to below as P.F. 
[ am using the word ‘index’ instead of ‘weight N’ which was used in 
t.P., so as to reserve the word weight for the ‘weight M’ of that paper, 
. the weight of a relative tensor. 
A co-ordinate system and factor together constitute a representation 
(x) according to Mr. Whitehead, P.F. § 3. 
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In one representation it is clear that we will have 
A=&. 
Let us make the convention that Greek indices run over the range 
n, whereas Latin ones take on the values 1, 2, ..., » only. 
Let A be the component of the projective scalar A in the representa- 
tion (°)-+-(#), obtained by a change of co-ordinate system as well as 
of factor from the representation (x°)-+-(x). Then 
0A 2A éxb 
ez* — axB ex" 
By reference to G.P. it is evident that @A/éx* are the components 
of a projective vector of index 1 and 


log A 
= at (1.3) 
are the components of a projective vector of index 0. Since 
0A/éx®° = A, we have A,= 1. 

We now introduce homogeneous co-ordinates in the tangent spaces 


by the formulas, 
. axt 
Xt=—, (1.4) 


The co-ordinates X* so defined satisfy the condition 
] 
mi 
A,X*= 7° 
The tangent affine space becomes a projective one, if we admit ideal 
5 
points whose co-ordinates satisfy the equation of the hyperplane 
A, A* = 90, (1.7) 
With this convention every set of »+1 numbers (X° X!... X”) is 
a set of homogeneous co-ordinates of some point of a tangent pro- 
jective space. Moreover, the homogeneous co-ordinates are trans- 


formed by the rule, 
ana 


CO 
oxB ’ 


under an arbitrary change of factor and co-ordinate system. Hence 


X« — XB 


they are components of projective vectors. Since A appears in the 
denominators in (1.4) and (1.5), these projective vectors are of 


index —1. 
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This introduction of homogeneous co-ordinates is made in such 
a way that the coefficients of the equation of the hyperplane at 
infinity in each tangent space shall be the components of the pro- 
jective gradient of A. We could if we wished replace A, by an 
arbitrary projective vector B,, so that the equation of the hyper- 
planes at infinity would be 

B. X*=§@, 
But in this case there would not exist a representation in which the 
equations of the hyperplanes at infinity of all tangent spaces are 
X°=0 


simultaneously. When A, is a gradient this happens in any repre- 


sentation for which 4 


— 


2. Equation of a quadric in the tangent space. The com- 
ponents of any covariant projective vector B, are the coefficients of 
the equation of a hyperplane in each tangent space, 
B_A*=xG, 
This equation represents the same hyperplane no matter what repre- 
sentation of the vector B, is used, for the left-hand member is 
unaltered by any change of representation. In non-homogeneous 
co-ordinates the equation is 
(B,A;— B;)da*t = By. 

In like manner the components of any covariant projective tensor 

Gg are the coefficients of a quadric, 

Gp X°XP = 0, 

in each tangent space. Since the left-hand member is unaltered by 
changes of representation, any point X which satisfies this equation 
in one representation does so in all representations. Hence there is 
a unique quadric determined by the projective tensor G,,— always 
with respect to a definite choice of the scalar A which determines the 
relation between homogeneous and non-homogeneous co-ordinates. 
In non-homogeneous co-ordinates the equation of the quadric is 

(G,; —A,@q;— A; Goi +A Aj Go9)da'da) 4- 2( Gg; — Gq A ;)dx' + Gog = 0. 

3. Projective tensor of second degree. Let us denote the index 
of an arbitrary covariant projective tensor G,, of degree 2 by 2N. 
From the law of transformation of such a projective tensor we see 
that Go. is a projective scalar of index 2N. Let us write 

p? = Goo: 
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so that © represents a projective scalar of index N. Then 
G 
Yop — GC 


700 


0B 


is a projective tensor of index zero such that 
Yoo = I. 
Again, from the law of transformation, if we define 
Px Yao? 
#, is a covariant projective vector of weight zero. Hence 
I up Yap —$ 8 
is a projective tensor for which 
J x00 = Jou = Y- 
Again, from the law of transformation it follows that the quantities 
g;; are the components of an affine tensor. Thus the projective tensor 
Gg determines and is determined by a projective scalar ®, a pro- 
jective vector ¢,, and an affine tensor g;;, according to the formula, 
Y 9 . 
G 75) P°(J a8 I pa Pp): (3.1) 
The geometrical meaning of this equation is easily seen. The point 
g g | 
of contact of the tangent space with the underlying space has homo- 


geneous co-ordinates . ae 
‘ a x e-4 do 


Hence an equation of the polar hyperplane of the point of contact is 


dp x X ; 0, 
and an equation of the tangent cone with this point as vertex is 
Jag X*XP = 0. 
In non-homogeneous co-ordinates these equations become 
(¢;—A,)dx'+1=0 
and g,dx'dai = 0 
respectively. 
Referring to the equation at the end of the last section, we see 
& 1 
that the equation of the quadric in non-homogeneous co-ordinates is 
a | ‘ ot _| Q29 
(Gj; p; )dar'dai +- 2,da'+-1= 0 (3.2) 
or gijda'da) +- (b,dx'+-1)? = 0, 
where ys; is an affine vector defined by the equations 
wb; = $;—A;. (3.3) 
Since ¢ 4,= 
; 0 —_ ? 
Pe P $y—Ay 


defines a projective vector whose null component is zero. 
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Let us denote the (n-+-1)-rowed determinant of the quantities y,g 

by y and the n-rowed determinant of the quantities g,; by g. Then 
a simple computation shows that 

y¥=9.- (3.4) 

We also consider the contravariant affine tensor g defined by the 


equations 


Ii = 9}, 
as well as the contravariant projective tensors y*? and Gf defined by 


yy... = 82 and G~°G,, = 82. (3.4) 
1 
Obviously 798 = ge” (3.5) 


and y*? and G8 are of indices 0 and —2N respectively. 

For all affine tensors we shall raise and lower indices in the usual 
manner by means of g/ and g,,, and for projective tensors by means 
of y*? and y,g. Thus, for example, 

Yap>6 = , and hence $* = 85. (3.6) 
It is a corollary of these equations that 


yb. dp =1. 
If we make the definition 


1/4, %, ' 
Por = a éxe)” (3.7) 
it follows that $00 = Poo = 9, 
and it is evident that ¢,; is an affine tensor. Hence we raise and 
g lower its indices by means of the g’s. Thus 


i___ 1lyik Opp od; 
j=1g Sat) 


Oa 


and so di = 0. 
We adopt the convention that, where the contrary is not stated, it 
is always the first index which is raised or lowered. This determines 
the sign of ¢}. In general, raising and lowering Greek indices amounts 
geometrically to polarizing with respect to the quadric 

Yap X*X — 0, 
and raising and lowering of Latin indices amounts to polarizing with 
respect to the degenerate quadric 

g,jda'dxi = 0. 

Tr 
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The relation between y*? and the affine tensor may be found as 
follows : By (3.4) and (3.6), 
8 = YY a = YI Gort abe) = ¥Gjx- 
Hence we have yi = gil, (3.8) 
Again, 0 = y™"y.9 = 9‘ bj; +y™. 
It follows that yo = y! = —giid.. (3.9) 
We do not write gi‘¢;=¢' because the quantities ¢’ are not the 
components of an affine tensor. Finally 
1 = YY 0 = —9"$j bi ty”. 
Hence y = 1+ 94; ¢;. (3.10) 
The affine tensor g;, determines an affine connexion such that 
Ji = Y, 


where the comma denotes covariant differentiation. The components 
of this affine connexion are given by the Christoffel formulas 


Cin} = g"[a.jh] 


: ° l Of aj | Of ak OF ire 
in which [a,jk]= eit Fak _ ZH). 


Crk Oxi = ax 

4. The projective connexion [. There are two projective 
connexions which are determined in a formally obvious way by the 
tensor Gg, namely, the projective connexion I" such that the pro- 
jective derivative of y,g with respect to it is zero, and the projective 
connexion II such that the projective derivative of Gg with respect 
to it is zero. In this section we shall derive some of the formulas 
relating to [.. We indicate projective differentiation with respect to 
I’ by a semicolon, so that 


= OY ab 


Yapiy = Oxy —Yao By Yop | ty = 0. (4.1) 


These equations can be solved in the usual way to give 


; syne f OY oy _ es) 


Oxy oaB Oxue 


_ vam ea Joy _ ze) ) 


" Gay  OxB = Oxe 


. oa Ode | ad, x 
Li me a *) +Y"(Poy bet bap by)» 
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¢,, being defined by (3.7). It is immediately obvious on using 
formulas such as (3.6) that 

Tag = tao} 5p- (4.3) 

If we substitute 8B = 0 and y = 0 in (4 a it is clear that 

ry. = 0. (4.4) 
Making the same substitution in (4.1) we have 
$,1% = 0. (4.5) 
Substituting 8 = 0 and y = 0 separately in (4.1) we have 
bay = 8 4.0%) = Yao hy 
and Yop P20 = —Ya0 DB: 
From these two sets of equations together it follows that 
Posy = —P ya 
and hence that* Pap = Pap: (4.6) 
Returning to (4.2) we find 
Po = yPay = ¥ iy, (4.7) 
which yield yA 43 and I’, = —4¢; 4}. (4.8) 
It also follows from (4.2) that 
Ye = Gid+ (bj be +-$ie$5) (4.9) 
and I). : —$; ‘kts (zt + ser). (4.10) 
The projective curvature tensor (cf. G.P., p. 156) for the con- 
nexion T° is given by the formula 
Ct ee oy ey eo 
Since Igo # 8B; 
its components do not vanish whenever one of the subscripts is zero, 


but it does satisfy 
Br, =0 (4.12) 


Hence the following equation defines an affine tensor 
Bin = $7 4i.. (4.13) 
On contracting (4.11) with respect to « and y we obtain a projective 
tensor analogous to the Ricci tensor, 
Bag = RiBidh—4%,(8.bg+34 ba) +245 bert b1¢i baby (4.14) 
* For this relation I am indebted to Dr. W. P. Allis. 


sz 
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where R;; represents the Ricci tensor (contracted curvature tensor) 
formed from {i,}. From this we get at once the scalar, 

B= y*#B,pg = R—4 $5, (4.15) 
where R is the scalar curvature. The formula (4.14) also yields the 
affine scalar Bu= 4d, (4.16) 


the affine tensor Bi = Ri+29%¢' gh, (4.17) 
and the affine vector 
. ** C is | 
Bin =e = | ED) (4.18) 
= Ng Ct 
5. The projective connexion II. The other projective connexion 
referred to at the beginning of section 4 is the one for which 
me oG Y oa 
oply = oe Gq 113, — Gog Tg, = 0. 


e 


G 


The solidus will be used to denote projective differentiation with 
respect to II. For y= 0, (5.1) gives the relation, 
Yao I fo+Yop ITZ = 2NY og: 
On account of the relation 
Gig = Dy 08; 
the formal work of computing the components of II will be similar 
to that which arises in conformal geometry. Expanding equations 


(5.1) we have ‘ Lah 
209, ¥.g +P Yap, y ==. 


1 dlog® 2 
a= 1s 9 


We may write 


the factor 1/N being introduced so that we shall have 
D, = ] P 
and the equations defining II reduce to 
Yopiyt2N og PD, = 0. (5.4) 

Now we use the fact that the difference between any two pro- 

jective connexions is a projective tensor, and so write 
i 1 sym 
IT, = | By T'p,,. 

The determination of IT3,, therefore, can be reduced to the computa- 
tion of the components of 7'g,. If we expand (5.4) we now find 


OV Yo a la to | OW. = 
( — Yop! Za Yael fa) — Yop PEAY a0 VB t+2Ny ag?) = 0. 
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The terms in parenthesis add up to zero because of (4.1). Hence we 
have algebraic equations for 73, which, solved in the usual manner, 
a Th, = N(8} 0, +83 Og—y ng). (5.5) 

In some of the applications it is desirable to have particular sets 
of components of IT}, explicitly in terms of g, ¢, and ®. In order to 
write these formulas conveniently, we define 

6, = $4—Py. (5.6) 
Thus 6, = 0, and 6; is an affine vector, and we shall raise and lower 
its indices by means of the g’s. From 
Oo = y+ yO, 
it follows by (3.8), (3.9), and (3.10) that 
o' = —# and ©° = 1+ 6g. | (5.7) 

In like manner, O*®, = 14 6°6;. (5.8) 

Thus we find T°, = N(1—6%d,), Tin = NO, (5.9a) 
Tio = —N(6;+-9; 9,%), jo = N(8;+-6'4,), (5.9b) 

Tie = —N Gj +$; bx 1+ 6'9;); (5.9e) 

in = N[8}O, 4-8), D,+-6G;.+-4; dy)]- (5.9d) 

There are also some formulas connecting projective derivatives 
with respect to [ with covariant derivatives with respect to {i,}. 

oD 
Thus, since ,,,—= __*— T'¢g@,, we get 
us, since ®y.g= > g®,, we ge 
Poo = 0. (5.10) 
Also by reference to (4.8) 


a i 
fal] PRY 


oxt Yi 


a 


Oy, = COT + TZ, 0° = — 
ox 


Thus, if we use commas to denote covariant derivation with respect 


to (its 2(Ai./, 
se acta cee ee, (5.11) 
' Ng ox 
Again, using (4.8) o', = —4} 6. (5.12) 
And also, y7 :, a gi*($;,—8',) =— = 39°"g?? (Ba, +9, a) (5.13) 
We now calculate the curvature tensor formed out of the IT, which 
we shall denote by P%,5. Since 17g, = 1g,+7'g,, we have 


Pays i Bay3+ T by3— Te8y+ Th, T os— Teel oy (5.14) 
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where B3,5 is, as previously defined, the curvature tensor formed 
from the I'3,, and 7’,.5 denotes the covariant derivative of 72, with 
respect to the Ig By" Using (5.5) we have 


P Byd ° Bag t+ NS KPp. 3— N®, 0;+NO°®, pst (5.15) 
—N85{Op,,—NO,9,+NOO,y,,} 
—Nypg {02 — DOs} 
+N ypga{O?, —0°9,}. 
On contracting with respect to « and y we obtain the generalized 
Ricci tensor, 
Pes = Bos+-N(n—1){Ogs—NOgD3+ NOD, yp5}-+ Nyps OZ, (5.16) 
and the scalar, 
P = B—2Nn6';,+ N2n(n—1)(1+0°0,). 
From (5.16) we also get the affine invariants, 
P4 yiyBP ip (5.18) 
: BI+N?(n—1)[g4(1+6,0,)—0'0)]— 
—4N (n—1)g'4g(8,, 5 +84,«) —Ngg?y 
and Pi an P*P (5.19) 
= Bi+N(n—1)[NO'—¢gif'), 
and Poo = Bot N2(n—1)0; 0'—NO',. (5.20) 
6. The flat case. The flat projective geometry is the theory of 
a projective connexion which in a particular representation (cf. G.P., 
p. 159) has the components 
IT p, = 88 8),+ +8), 3g —85 92 OY. (6.1) 
The differential equations 
ox 
g8 
are completely integrable and have solutions of the form 
A* = pe (q)X8, (6.3) 
in which the X,’s are arbitrary constants multiplied by exp(—q°). 
The X,’s can be regarded as homogeneous co-ordinates in the tangent 
space at a fixed point, say y= 4q , of the underlying space and the 
X’s as homogeneous co-ordinates in the tangent space at a variable 
point, y=q. The equations (6.3) then represent a projective trans- 
formation from the one tangent space to the other. 


Gp X° = 0 
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But, as is proved at the end of G.P., the X,’s and X’s which 
satisfy (6.3) can be expressed as functions of y, each such function 
satisfying 
Bx. —IIgg oe 8. (6.4) 
dy*oyB dy? 
In other words, if n+ 1 independent functions X, satisfy these equa- 
tions, they define a transformation from the underlying space to the 
tangent space at y= q, and, moreover, the functions X related to 
X, by (6.3) also satisfy (6.4) and define a transformation from the 
underlying space to the tangent space at y=gq. Hence the X,’s and 
the X’s can both be regarded as homogeneous co-ordinates for the 
underlying space, and (6.3) thus defines a projective transformation 
of the underlying space into itself. 

This may be summarized by saying that in the flat projective 
geometry all the tangent spaces can be regarded as coinciding with 
one another and with the underlying space. 

If we take q, to be (0 0 ... 0) we find 


Xo = e” and X! — evyi (6.5) 


as solutions of (6.4), and the co-ordinates y are cartesian. The equa- 
tion of a quadric in cartesian co-ordinates is 


a;; yy! + 2dp; Y! +49 = 9. (6.6) 


In the homogeneous co-ordinates X, this becomes 
0 


Ang Xe X8 — 0. (6.7) 
More generally, we can replace (6.5) by 
X° = e”-? and X‘ = (y'—q')e”-, (6.8) 
or X° = X8e-@ and X‘ = (Xi—q'X$)e@. 


When the X’s are defined in this way, which is a special case of (6.3), 
they satisfy (6.2) as functions of g and (6.4) as functions of y. The 
X’s can be regarded as homogeneous co-ordinates in the tangent 
space associated with the point g and factor g°—or, if we like, as 
homogeneous co-ordinates in the underlying space, so chosen that 
the point whose homogeneous co-ordinates are proportional to 85 has 
cartesian co-ordinates (gq! q® ... q"). 
The equation of the quadric (6.7) is 
05; YY) + aig ¥' +400 = Gily'—F' LY —@) +2 (Gin + Gis PY —G') + 
+ (Ao + 249 9° +44 9'7') = 0. 
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Hence in the variable homogeneous co-ordinate system defined by 
(6.8) the equation of the quadric is 

Gg X°XP = 0 (6.9) 


-_——_ ~ | ‘yY  .Oes | ; 
Gj; = e'a,;,, GC, =e (Aig t+ iy), 


in which (6.10) 


| 2. Ww g2q? 9 i_} ‘gins 
Grog = €78"(dgg + 20g Y' +44; 9'9"). 


The X°’s as functions of g are the components of projective vectors 
of index —1. Hence the left-hand member of (6.9) will be invariant, 
if the functions Gg are the components of a projective tensor of 
index 2. This tensor in each representation gives us the coefficients 
of the equation of the quadric in the projective co-ordinates asso- 
ciated with the point q. 

It is easy to verify that the projective derivative of the projective 
tensor Gy is zero, i.e. that 


Gob _@T1e, Gg 11 6.11 
oxy —Ungg By a ‘of ay = @, ( ). ) 
We have merely to substitute (6.10) and (6.1) in these equations. 
Thus we see that the theory of a quadric in flat projective geometry 
may be regarded as the theory of a projective tensor of degree 2 and 
index 2 whose projective derivative with respect to a flat projective 
connexion is zero.* 

Since the projective connexion II of section 5 is now given in one 
representation by (6.1), we infer from (4.4) and (5.9a) that 

Pie ¢ - ; 9 
‘=Oand ¢,=—Q,. (6.12) 
Hence the equation of the quadric in the homogeneous co-ordinates 
associated with any point x = q can be written 
Iij X‘Xi+ (® x X - = 0, 
which corresponds to the fact that 
cr 
M, X 0 

* Obviously the equation (6.9) can be multiplied through by any factor of 
the form exp(Mq’) without affecting its validity as an equation of the quadric. 
Therefore the quadric can be represented by a projective tensor of degree 2 
and any index. But, if the index is not 2, the equation (6.11) must be re- 
placed by 

2 Y y Dron cee 
(6.1lla) GoBiy—G xpioPy = 9, 


in which B, is the projective vector whose components, in the representation 
for which the components of II are given by (6.1), are 
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is the equation of the hyperplane polar to x = q, and 
is the equation of the tangent cone through x = gq. 

Moreover, gdq'd¢ 
is the fundamental differential form of the non-Euclidean metric 
with the quadric as absolute. This is to be expected, because 
this metric is determined in an invariant way by the quadric. 
[t can be proved in the following manner. Consider the cartesian 
co-ordinates 

xi 
=O, x" 
In these co-ordinates the equation of the quadric is 
G77 +1=0. 

Hence the quadric is a sphere of radius 1 in a Euclidean space whose 
hyperplane at infinity is the polar of the point z=0. The funda- 
mental tensor of the Euclidean space is g;; at z= 0 and its com- 
ponents are constant in the co-ordinate system z. Thus a cross-ratio 
of two points z'= 0 and z‘ = dz‘ with the two points in which the 
line joining them meets the quadric is 


zt 








14 Nyy dada’ 
hw Vg;; da'dax) 


The Cayley definition of non-Euclidean distance is ylogR which 
gives ds? — 4y%g gdg'dgi 

for the element of distance. Hence g;; is the fundamental tensor of 
the non-Euclidean metric with the quadric as absolute; and this 
metric at any point is identical with a Euclidean metric whose abso- 
lute is the intersection of the polar hyperplane of the point with 
the quadric. 

The values of the Christoffel symbols of the non-Euclidean metric 
come out at once from the general theory. For by (4.9) and (5.9d) 
we have 

0= Mj, =F} +, +3,%,) 
and in this representation, 


D = e® Nagy + 2aigq' +4; 7'7- 
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In like manner we deduce from (4.10) and (5.9c) that 
70 _. 2D; : 
0=Nh.= ~ D {ind —Ijn—; D;, 
1 @&® 
® oxi dak’ 


em, | 
or Ijx = é yk — ®, ®,. = 


These remarks on the non-Euclidean geometry should be compared 
with those of Mr. Whitehead in P.F. 

7. Infinitesimal projective displacements. Any projective 
connexion can be interpreted as determining an infinitesimal dis- 
placement of the tangent spaces of the sort studied by Cartan, 
Schouten, and others. We have explained this for the flat case in 
the section above as an interpretation of the equations 

X'e 0, (7.1) 
in which X°% is of index —1. The same equations can be used for 
the geometry-of-paths case (cf. G.P., p. 150 et seq.), that is, for any 
connexion A such that 

Ago = 9p. (7.2) 
But in general the projective connexions T° and II do not satisfy (7.1) 
when 8=0. We must therefore use another set of equations, if we 
are to interpret these connexions in terms of projective displace- 
ments. We turn to the equations 

X, =X), Bg, (7.3) 
in which Bg is a projective vector such that 

m= i. 

The latter condition insures that the equations shall be satisfied when 
8B=0. The equations (7.3) expand into 
oX* 


aap + Hep X°+ By X°—M 


a0 


X° Be - 0 


which are of the form ae 


oxB + AG pX°? Q, 


where AGp = Wop—W oo Bg +85 Bp. (7.5) 
The quantities A}, are the components of an unsymmetric projective 
connexion for which (7.2) is satisfied. 
In general, the equations (7.4) are not integrable. But, along an 
arbitrary curve 2’ = a‘(t), we have 
dX” dxi 


apt Acs gy X°=9 
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which, being linear, integrates into 
X* =x P3(t)X8, (7.6) 
where X8 are arbitrary constants. We may interpret X8 as homo- 
geneous co-ordinates in the tangent space at the point t= 0 and X* 
as homogeneous co-ordinates in the tangent space at any other point 
of the curve. The equations (7.6) then represent a projective trans- 
formation of the first tangent space into the second. In this sense 
we may say that 
dX*+-At, X°dxi = 0 (7.7) 
represents an infinitesimal projective transformation of the tangent 
space at any point x into that at any point «+dz. 
The equations (7.7) represent a single infinitesimal transformation 
not, as one might expect, an infinite family of them. For, if we 
make a change of factor, the components Aj, are changed, but so 
also are the components X* which represent a point of the tangent 
space, and the expressions on the left-hand side of (7.7) are trans- 
formed as the components of a projective vector. 
Let us write (7.7), using V for the non-homogeneous co-ordinates 
of any point in the tangent space and dz for the co-ordinates of the 


a 
Vi = AX'= 37, 


displacement. So 


and (7.7) becomes 
dV‘ (Ajy,—Aj, A,,—A, Aj; 8).)V¥dai + (7.8) 


] 


ox! - 


+Vi (‘ Ay —A, Ajj; +A, A, Ag ) V¥dai +-Aj; dai = 0. 
It can be verified directly, if desired, that this expression is unaltered 
by a change of factor. 
[t is always possible to choose the factor so that Ay = 1 and A; = 0. 
In this representation (7.8) becomes 
dV! + Aj, V¥dai —A0Vidai—AP.V'V¥dai+ Ai, dai = 0. (7.9) 
We now ask what happens to the quadrics 
Gg X*XP — 0, (7.10) 
when the tangent spaces undergo the projective displacements deter- 
mined by (7.7). If we differentiate the left-hand member of (7.10) 
and substitute (7.4), we obtain 


ae oe 
( — 8 _ Gop AS, —Gag A /)x XB, 


B 


OX 
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Any one of the quadrics will be transformed into another one, if these 
quantities vanish, when and only when (7.10) is satisfied. This will 
happen if 
0Gub_@ Ag —G.. At = Gg C (7.11) 
Oa” opty Tao ** By ~ Sap y ; 
for some projective vector C,. On substituting (7.4) in these equa- 
tions and using (5.1) and (5.2), we obtain 
2(N—1)B,=C,, 
which determines C, so that (7.11) is satisfied. Hence any tangent 
quadric is transformed into another by the infinitesimal projective 
transformations determined by II according to the formulas (7.3). 
The discussion in this section up to (7.11) can be carried through 
unaltered, if the projective differentiation in (7.3) is taken to be with 
respect to I" instead of II. Let us, then, substitute the equation like 
(7.5), with T’ replacing IT, in (7.11) and obtain 
—2B,=C,,. 
Hence the tangent quadrics are transformed into themselves by the 
infinitesimal projective transformation determined by I" according 


to the equations 


X* = X% B,. 
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